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POSH INTERACTING PROTEINS AND RELATED METHODS 

RELATED APPLICATIONS 

This application claims the benefit of priority of U.S. Provisional 
Application number 60/451,437 filed 3 March 2003; 60/452,284 filed 5 March 
5 2003; 60/456,640 filed 20 March 2003; 60/460,526 filed 3 April 2003; 60/464,285 
filed 21 April 2003; 60/469,462 filed 9 May 2003; 60/471,378 filed 15 May 2003; 
60/472,327 filed 20 May 2003; 60/474,706 filed 30 May 2003; 60/475,825 filed 3 
June 2003; 60/479,317 filed 17 June 2003; 60/480,376 filed 19 June 2003; 
60/480,215 filed 19 June 2003; 60/493,860 filed 8 August 2003; 60/503,931 filed 16 

10 September 2003; 60/455,760 filed 19 March 2003; 60/460,792 filed 4 April 2003; 
60/498,634 filed 28 August 2003; and a provisional application filed on March 2, 
2004, (Attorney Docket No. PROL-P79-024), in the name of Daniel N. Taglicht, Iris 
Alroy, Yuval Reiss, Liora Yaar, Danny Ben-Avraham, Shmuel Tuvia, and Tsvika 
Greener entitled "Posh Interacting Proteins and Related Methods"; a PCT 

15 application US03/35712 filed 10 November 2003; and a PCT application filed on 
February 5, 2004, (Attorney Docket No. PROL-PWO-039), in the name of Iris 
Alroy, Daniel Taglicht, Yuval Reiss, Liora Yaar, and Shmuel Tuvia entitled "Posh 
Associated Kinases and Related Methods". The teachings of the referenced 
Applications are incorporated herein by reference in their entirety. 

20 

BACKGROUND 

Potential drug target validation involves determining whether a DNA, RNA 
or protein molecule is implicated in a disease process and is therefore a suitable 
target for development of new therapeutic drugs. Drug discovery, the process by 

25 which bioactive compounds are identified and characterized, is a critical step in the 
development of new treatments for human diseases. The landscape of drug 
discovery has changed dramatically due to the genomics revolution. DNA and 
protein sequences are yielding a host of new drug targets and an enormous amount 
of associated information. 

30 The identification of genes and proteins involved in various disease states or 

key biological processes, such as inflammation and immune response, is a vital part 
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of the drug design process. Many diseases and disorders could be treated or 
prevented by decreasing the expression of one or more genes involved in the 
molecular etiology of the condition if the appropriate molecular target could be 
identified and appropriate antagonists developed. For example, cancer, in which one 
5 or more cellular oncogenes become activated and result in the unchecked 
progression of cell cycle processes, could be treated by antagonizing appropriate cell 
cycle control genes. Furthermore many human genetic diseases, such as 
Huntington's disease, and certain prion conditions, which are influenced by both 
genetic and epigenetic factors, result from the inappropriate activity of a polypeptide 
10 as opposed to the complete loss of its function. Accordingly, antagonizing the 
aberrant function of such mutant genes would provide a means of treatment. 
Additionally, infectious diseases such as HIV have been successfully treated with 
molecular antagonists targeted to specific essential retroviral proteins such as HIV 
protease or reverse transcriptase. Drag therapy strategies for treating such diseases 
15 and disorders have frequently employed molecular antagonists which target the 
polypeptide product of the disease gene(s). However, the discovery of relevant gene 
or protein targets is often difficult and time consuming. 

One area of particular interest is the identification of host genes and proteins 
that are co-opted by viruses during the viral life cycle. The serious and incurable 
20 nature of many viral diseases, coupled with the high rate of mutations found in many 
viruses, makes the identification of antiviral agents a high priority for the 
improvement of world health. Genes and proteins involved in a viral life cycle are 
also appealing as a subject for investigation because such genes and proteins will 
typically have additional activities in the host cell and may play a role in other non- 
25 viral disease states. 

Other areas of interest include the identification of genes and proteins 
involved in cancer, apoptosis and neural disorders (particularly those associated with 
apoptotic neurons, such as Alzheimer's disease). 

It would be beneficial to identify proteins involved in one or more of these 
30 processs for use in, among other things, drug screening methods. Additionally, once 
a protein involved in one or more processes of interest has been identified, it is 
possible to identify proteins that associate, directly or indirectly, with the initially 
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identified protein. Knowledge of interactors will provide insight into protein 
assemblages and pathways that participate in disease processes, and in many cases 
an interacting protein will have desirable properties for the targeting of therapeutics. 
In some cases, an interacting protein will already be known as a drug target, but in a 
5 different biological context. Thus, by identifying a suite of proteins that interact 
with an initially identified protein, it is possible to identify novel drug targets and 
new uses for previously known therapeutics. 

SUMMARY 

10 This application provides isolated, purified or recombinant complexes 

comprising a POSH polypeptide and one or more POSH-associated protein (POSH- 
AP). In certain aspects, the POSH-AP comprises a polypeptide selected from the 
group consisting of: PKA, SNX1, SNX3, ATP6V0C, PTPN12, PPP1CA, GOSR2, 
CENTB1, DDEF1, ARF1, ARF5, PACS-1, EPS8L2, HERPUD1, UNC84B, 

15 MSTP028, GOCAP, EEF3S3, SRA1, CBL-B, RALA, SIAH1, SMN1, SMN2, 

SYNE1, TTC3, VCY2IP1 and UBE2N (UBC13). In other aspects, the POSH-AP 
comprises a polypeptide selected from the group consisting of: ARHV (Chp), 
WASF1, HIP55, SPG20, HLA-A, and HLA-B. In further aspects, the POSH-AP 
comprises one or more polypeptides set forth in Table 8. In certain embodiments the 

20 POSH polypeptide is a human POSH polypeptide. 

In certain embodiments, this application provides isolated, purified or 
recombinant complexes comprising a HERPUD1 polypeptides and a ubiquitin 
ligase, examples of the ubiquitin ligase include CBL-B, TTC3, and SIAH1. 

In certain embodiments, the application provides methods for identifying 

25 agents that modulates an activity of a POSH polypeptide or POSH-AP, comprising 
identifying an agent that disrupts a complex of a POSH polypeptide and a POSH- 
AP, wherein an agent that disrupts such a complex is an agent that modulates an 
activity of the POSH polypeptide or the POSH-AP. 

In yet other embodiments, the application provides methods of identifying an 

30 antiviral agent, comprising identifying a test agent that disrupts a complex 

comprising a POSH polypeptide and a POSH-AP and evaluating the effect of the 
test agent on either a pro-infective or pro-replicative function of a virus is an 
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antiviral agent, wherein an agent inhibits such a function of a virus is an antiviral 
agent. In certain embodiments the POSH-AP is selected from the group consisting 
of: PKA, SNX1, SNX3, PTPN12, GOSR2, CENTB1, ARF1, ARF5, PACS-1, 
EPS8L2, HERPUD1, SMN1, SMN2, UNC84B, MSTP028, GOCAP, CBL-B, 
5 SYNE1, UBE2N (UBC13), SIAH1, TTC3, WASF1, HIP55, RALA, and SPG20. 
Examples of such viruses include for example, envelope viruses such as the Human 
Immunodeficiency Virus, the West Nile Virus, and the Moloney Murine Leukemia 
Virus (MMuLV). 

In other embodiments, the application provides methods of identifying an 

10 anti-apoptotic agent, comprising identifying a test agent that disrupts a complex 
comprising a POSH polypeptide and a POSH-AP and evaluating the effect of the 
test agent on apoptosis of a cell wherein an agent that decreases apoptosis of the cell 
is an anti-apoptotic agent. In yet other embodiments, the application provides 
methods of identifying an anti-cancer agent, comprising identifying a test agent that 

1 5 disrupts a complex comprising a POSH polypeptide and a POSH-AP and evaluating 
the effect of the test agent on proliferation or survival of a cancer cell, wherein an 
agent that decreases proliferation or survival of a cancer cell is an anti-cancer agent. 
Examples of the POSH-AP include PKA, SNX1, PTPN12, PPP1CA, ARF1, ARE5, 
CENTB1, EPS8L2, EEF3S3, CBL-B, RALA, SIAH1, TTC3, ATP6V0C, and 

20 VCY2IP1. In certain embodiments, the cancer is a POSH-associated cancer. 

In certain aspects, the application provides methods of identifying an agent 
that inhibits trafficking of a protein through the secretory pathway, comprising 
identifying a test agent that disrupts a complex comprising a POSH polypeptide and 
a POSH-AP and evaluating the effect of the test agent on the trafficking of a protein 

25 through the secretory pathway wherein an agent that disrupts localization of said 
POSH-AP is an agent that inhibits trafficking of a protein through the secretory 
pathway. In certain embodiments, the protein is a myristoylated protein. In yet 
other embodiments, the protein is a viral protein. In alternative embodiments, the 
protein is associated with a neurological disorder such as for example the amyloid 

30 beta precursor protein. 

In yet other embodiments, the application provides methods of identifying an 
agent that inhibits the progression of a neurological disorder, comprising identifying 
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a test agent that disrupts a complex comprising a POSH polypeptide and a POSH- 
AP evaluating the effect of the test agent on the trafficking of a protein through the 
secretory pathway wherein an agent that disrupts localization of a POSH-AP is an 
agent that inhibits progression of a neurological disorder. In certain aspects the 
5 POSH-AP is HERPUD1 . 

In yet other embodiments, this application provides methods of treating a 
viral infection in a subject in need thereof, comprising administering an agent that 
inhibits a POSH-AP in an amount sufficient to inhibit the viral infection. The agent 
is one that: inhibits a kinase activity of the POSH-AP; inhibits expression of the 

10 POSH-AP; inhibits the ubiquitin ligase activity of the POSH-AP; inhibits the 

phosphatase activity of the POSH-AP; inhibits the GTPase activity of the POSH- 
AP; and inhibits the ubiquitination of the POSH-AP. In certain embodiments, the 
POSH-AP comprises a polypeptide selected from the group consisting of: PKA, 
SNX1, SNX3, SMN1, SMN2, PTPN12, GOSR2, CENTB1, ARF1, ARF5, PACS-1, 

15 EPS8L2, HERPUD1, TJNC84B, MSTP028, GOCAP, CBL-B, SYNE1, UBE2N 
(UBC13), SIAH1, TTC3, WASF1, HIP55, RALA, and SPG20. In certain aspects, 
the agent may be an siRNA construct, a small molecule, an antibody, or an antisense 
construct. 

In certain embodiments, the agent is an siRNA construct comprising a 
20 nucleic acid sequence that hybridizes to an mRNA encoding the POSH-AP. 

Examples include siRNA constructs that inhibit the expression of HERPUD1 or 
MSTP028. Examples of siRNA constructs that inhibit the expression of HERPUD1 
include: S'GGAAGUUCUUCGGAACCUdTdT-S' and 

5'- dTdTCCCUUCAAGAAGCCUUGGA-5'. Examples of siRNA constructs that 
25 inhibit the expression of MSTP028 include: 5 ' - AAGTGCTC ACCGAC AGTGAAG- 

3' and 5 AAGAT ACTTATG AGCCTTTCT-3 ' . 

In other aspects, the agents may be a small molecule inhibitor is selected from 

among the following categories: adenosine cyclic monophosphorothioate, 

isoquinolinesulfonamide, piperazine, piceatannol, and ellagic acid. In alternative 
30 embodiments, the agents may be a small molecule inhibitor that inhibits the ligase 

activity of a POSH polypeptide or inhinbits the ubiquitination of a POSH-AP; 

Examples of such small molecules include, for example: 
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10 In certain embodiments, the application provides packaged pharmaceuticals 

for treating viral infections, comprising: a pharmaceutical composition comprising 
an inhibitor of a POSH-AP and a pharmaceutical^ acceptable carrier and 
instructions for use. 

In certain embodiments, the application provides methods of treating or 

1 5 preventing a POSH associated cancer in a subject comprising administering an agent 
that inhibits a POSH-AP to a subject in need thereof, wherein said agent treats or 
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prevents cancer. The POSH-AP comprises a polypeptide selected from the group 
consisting of: PKA, SNX1, PTPN12, PPP1CA, CENTB1, ARF1, ARF5, EPS8L2, 
EIF3S3, CBL-B, RALA, SIAH1, TTC3, ATP6V0C, and VCY2IP1. 

In yet other aspects, the application provides methods of treating a 
5 neurological disorder comprising administering an agent to a subject in need thereof, 
wherein said agent either inhibits the Ubiquitin ligase activity of POSH or inhibits 
the ubquitination of a POSH-AP. Examples of the POSH-AP include: PTPN12, 
DDEF1, EPS8L2, HERPUD1, GOCAP, CBL-B, SIAH1, SMN1, SMN2, TTC3, 
SPG20, SNX1, and ARF1. 
10 Examples of the neurological disorders include Alzheimer's disease, Parkinson's 

disease, Huntington's disease, schizophrenia, Niemann-Pick's disease, and prion- 
associated diseases. In certain aspects, the agent is selected from the group 
consisting of: an siRNA construct, a small molecule, an antibody, and an antisense 
construct. Examples of the small molecules include: 

15 

O 




-7- 



WO 2004/078130 



PCT/US2004/006308 




5 

In certain aspects, the disclosure provides methods of treating viral hepatitis 
in a subject in need thereof. Such a method may comprise administering an 
effective amount of an agent that inhibits POSH or disrupts an interaction between 
POSH and a dynamin, preferably dynamin II. In certain embodiments, the subject 

10 has a viral hepatitis caused by HBV or HCV. 

In certain aspects, the disclosure provides methods of inhibiting a 
hepatotrophic virus or a method for treating a disease associated with a hepatrophic 
virus, comprising administering an effective amount of an agent, wherein said agent 
inhibits POSH or an interaction between POSH and dynamin. In certain 

15 embodiments, the hepatrophic virus is selected from the group consisting of HAV, 
HBV, HCV, HDV, and HEV. The hepatotrophic virus associated disease may be, 
for example, viral hepatitis or hepatocellular carcinoma. An agent for any of the 
above methods may include, for example, a nucleic acid agent that decreases the 
level of POSH in cells of the subject (e.g., an antisense oligonucleotide, an RNAi 

20 construct, a DNA enzyme, a ribozyme) or small molecule inhibitors of POSH, as 
well as antibodies or other binding agents that bind to a surface of POSH or 
dynamin that participates in a POSH-dynamin interaction. An agent may be any of 
the following: a small molecule, an antibody, a fragment of an antibody, a 
peptidomimetic, and a polypeptide. Examples of small molecules include: 
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STRUCTURE 


MW 


CAS number 


F 

1 F 

•> 


384.2 


14567-55-4 


Br O 

Br 


389.5 


414908-38-0 



In certain embodiments, the application provides methods for inhibiting an 



5 HBV infection in a subject in need thereof, comprising administering an effective 
amount of a POSH inhibitor, wherein the HBV infection is inhibited in the subject. 
In additional embodiments, the disclosure provides methods for treating an HBV 
infection in a patient, comprising administering an effective amount of an agent that 
inhibits POSH or decreases the level of POSH protein or nucleic acid in an infected 

10 cell. An agent may be, for example, an RNAi construct that inhibits the expression 
of POSH. Optionally the RNAi construct is 20-25 nucleotides in length and 
optionally it is selected from any one of SEQ ED NOS: 15, 16, 18, 19, 21, 22, 24, 
and 25. The RNAi may be formulated as a liposome. An agent may be a small 
molecule inhibitor of POSH ubiquitin ligase activity, as disclosed herein. Examples 

15 of small molecule inhibitors of POSH include: 



-9- 



WO 2004/078130 



PCT/US2004/006308 



STRUCTURE 


MW 


CAS number 




384.2 


14567-55-4 


Br O 

Br 


389.5 


414908-38-0 



In certain aspects, the disclosure provides a method for treating an HBV 
infection in a patient, comprising administering an effective amount of an antisense 
5 oligonucleotide sufficient to bind a nucleic acid molecule, which nucleic acid 
molecule encodes a POSH polypeptide. 

In certain embodiments, the application provides methods for inhibiting an 
HBV infection by administering an effective amount of a compound of the formula: 



F 




10 In additional embodiments, the application provides methods for treating an 

HBV infection by administering an effective amount of a compound of the formula: 
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Br 



In certain aspects, the disclosure provides methods for inhibiting the 
maturation of a lentivirus by modulating the activity of a Vpu polypeptide. In 
preferred embodiments, maturation of the lentivirus is inhibited by inhibiting the 
5 transport and/or assembly of viral particles in the TGN and from the TGN to the 
plasma membrane. A preferred lentivirus for application of such a method is the 
human immunodeficiency virus. 

In certain aspects, the disclosure provides methods of inhibiting viral 
infection comprising administering an agent to a subject in need thereof, wherein 
10 said agent inhibits the interaction between a POSH polypeptide and Vpu. 

In certain aspects, the disclosure provides methods for identifying a target 
polypeptide for antiviral therapy, the method comprising: a) selecting a test 
polypeptide known to localize or predicted to localize to the trans Golgi network; b) 
inhibiting an activity of the test polypeptide in a cell infected with a viral construct 
15 under conditions where, but for the inhibition of the activity of the test polypeptide, 
viral particles are released from the cell; and c) determining whether viral particles 
are released from the cell, wherein, if inhibiting the activity of the test polypeptide in 
the cell inhibits the release of viral particles from the cell, the test polypeptide is a 
target polypeptide for antiviral therapy. In a preferred embodiment, the test 
20 polypeptide is Vpu. Vpu activity may be inhibited, for example, by siRNA, 
antisense or other nucleic acid based method. 

In certain aspects, the disclosure provides isolated, purified or recombinant 
complexes comprising a POSH polypeptide and a Vpu polypeptide. The POSH 
polypeptide may comprise, for example, a POSH SH3 domain, or a polypeptide at 
25 least 80% identical to such an SH3 domain. An antiviral agent may be selected 
based on its ability to disrupt a POSH-Vpu complex. 

The practice of the present application will employ, unless otherwise 
indicated, conventional techniques of cell biology, cell culture, molecular biology, 
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transgenic biology, microbiology, recombinant DNA, and immunology, which are 
within the skill of the art. Such techniques are explained fully in the literature. See, 
for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, 
5 Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 
ed., 1984); Mullis et aL U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. 
D. Hames & S. J. Higgins eds. 1984); Transcriptioji And Translation (B. D. Hames 
& S. J. Higgins eds. 1984); Culture Of Animal Cells (R. L Freshney, Alan R. Liss, 
Inc., 1987); Immobilized Cells And Enzymes (ERL Press, 1986); B. Perbal, A 

10 Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology 
(Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. 
Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 
Enzymology, Vols. 154 and 155 (Wu et al. eds.), Immunochemical Methods In Cell 
And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); 

15 Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. 
Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

Other features and advantages of the application will be apparent from the 
following detailed description, and from the claims. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows human POSH coding sequence (SEQ ID NO:l). 
Figure 2 shows human POSH amino acid sequence (SEQ ID NO:2). 
Figure 3 shows human POSH cDNA sequence (SEQ ID NO:3). 
25 Figure 4 shows 5' cDNA fragment of human POSH (public gi:10432611; 

SEQ ID NO:4). 

Figure 5 shows N terminus protein fragment of hPOSH (public gi:10432612; 
SEQIDNO:5). 

Figure 6 shows 3' mRNA fragment of hPOSH (public gi:7959248; SEQ ID 

30 NO:6). 

Figure 7 shows C terminus protein fragment of hPOSH (public gi:7959249; 
SEOIDNO:7). 
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Figure 8 shows human POSH full mRNA, annotated sequence. 
Figure 9 shows domain analysis of human POSH. 

Figure 10 is a diagram of human POSH nucleic acids. The diagram shows 
the full-length POSH gene and the position of regions amplified by RT-PCR or 
5 targeted by siRNA used in figure 1 1 . 

Figure 11 shows effect of knockdown of POSH mRNA by siRNA duplexes. 
HeLa SS-6 cells were transfected with siRNA against Lamin A/C (lanes 1 , 2) or 
POSH (lanes 3-10). POSH siRNA was directed against the coding region (153 - 
lanes 3, 4; 155 - lanes 5, 6) or the 3'UTR (157 - lanes 7, 8; 159 - lanes 9, 10). Cells 
10 were harvested 24 hours post-transfection, RNA extracted, and POSH mRNA levels 
compared by RT-PCR of a discrete sequence in the coding region of the POSH gene 
(see figure 10). GAPDH is used an RT-PCR control in each reaction. 

Figure 12 shows that POSH affects the release of VLP from cells. A) 
Phosphohimages of SDS-PAGE gels of immunoprecipitations of 35 S pulse-chase 
15 labeled Gag proteins are presented for cell and viral lysates from transfected HeLa 
cells that were either untreated or treated with POSH RNAi (50 nM for 48 hours). 
The time during the chase period (1, 2, 3, 4, and 5 hours after the pulse) are 
presented from left to right for each image. 

Figure 13 shows release of VLP from cells at steady state. Hela cells were 
20 transfected with an HIV-encoding plasmid and siRNA. Lanes 1, 3 and 4 were 
transfected with wild-type HIV-encoding plasmid. Lane 2 was transfected with an 
HIV-encoding plasmid which contains a point mutation in p6 (PTAP to ATAP). 
Control siRNA (lamin A/C) was transfected to cells in lanes 1 and 2. siRNA to 
TsglOl was transfected in lane 4 and siRNA to POSH in lane 3. 
25 Figure 14 shows mouse POSH mRNA sequence (public gi:10946921; SEQ 

ID NO: 8). 

Figure 15 shows mouse POSH Protein sequence (Public gi:10946922; SEQ 
ID NO: 9). 

Figure 16s hows D rosophila m elanogaster P OSH m RNA s equence (public 
30 gi:17737480; SEQ ID NO: 10). 

Figure 17 shows Drosophila melanogaster POSH protein sequence (public 
gi: 17737481; SEQ ID NO: 11). 
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Figure 18 shows POSH domain analysis. 

Figure 19 shows that human POSH has ubiquitin ligase activity. 
Figure 20 shows that human POSH co-immunoprecipitates with RAC1 . 
Figure 21 shows that POSH knockdown results in decreased secretion of 
5 phospholipase D ('TLB"). 

Figure 22 shows effect of hPOSH on Gag-EGFP intracellular distribution. 
Figure 23 shows intracellular distribution of HIV-1 Nef in hPOSH-depleted 

cells. 

Figure 24 shows intracellular distribution of Src in hPOSH-depleted cells. 
10 Figure 25 shows intracellular distribution of Rapsyn in hPOSH-depleted 

cells. 

Figure 26 shows that POSH reduction by siRNA abrogates West Nile virus 
infectivity. 

Figure 2? shows that POSH knockdown decreases the release of extracellular 
15 MMuLV particles. i 

Figure 28 shows that knock-down of human POSH entraps HIV virus 
particles in intracellular vesicles. HIV virus release was analyzed by electron 
microscopy following siRNA and full-length HIV plasmid transfection. Mature 
viruses were secreted by cells transfected with HIV plasmid and non-relevant siRNA 
20 (control, bottom panel). Knockdown of TsglOl protein resulted in a budding defect, 
the viruses that were released had an immature phenotype (top panel). Knockdown 
of hPOSH levels resulted in accumulation of viruses inside the cell in intracellular 
vesicles (middle panel). 

Figure 29 A shows siRNA-mediated reduction of MSTP028 expression 
25 inhibits HIV virus-like particle production (Experiment 1). 

Figure 29B shows siRNA-mediated reduction of MSTP028 expression 
inhibits HIV virus-like particle production (Experiment 2). 

Figure 30 shows putative PKA phosphorylation sites in hPOSH. Amino acid 
sequence of hPOSH (70 residues per line). Motifs of the low stringency RxxS/T 
30 type are underlined. The high stringency motif R/KR/KxS/T is bordered. Putative 
S/T phosphorylation sites are highlighted in green. Color-coding of domains: Red - 
RING, Blue - SH3, Green - putative Rac-1 Binding Domain. 
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Figure 31 shows that phosphorylation of hPOSH regulates binding of GTP- 
ioaded Rac-1. Bacterially expressed hPOSH (1 ^g) (POSH) or GST (1 fig) (NS) 
were phosphorylated. Subsequently, GTPyS loaded or unloaded recombinant Rac-1 
(0.2 \ig) was added to hPOSH or GST. Bound racl was isolated as described in 
5 materials and methods and samples separated by SDS-PAGE on a 12% gel and 
immunobloted with anti-Rac-1. Input is 0.25 ^ig of Rac-1. 

Figure 32 shows domain analysis of various POSH-APs. 

Figure 33 shows siRNA-mediated reduction in HERPUD1 expression 
reduces HTV maturation. 
10 Figure 34A shows that endogenous Herp levels are reduced in HI 53 cells. 

HI 53 (POSH-RNAi) and HI 87 (control RNAi) cells were transfected with aplasmid 
encoding Flag-ubiquitin. Total cell lysates (A) or Flag-immunoprecipitated material 
(B) were separated on 10% SDS-PAGE and immunoblotted with anti-Herp 
antibodies. 

15 Figure 34B shows that exogenous Herp levels and its ubiquitination are 

reduced in POSH-depleted cells. HI 53 and HI 87 cells were co-transfected with 
Herp or control plasmids and a plasmid encoding Flag-ubiquitin (indicated above the 
figure). Total (A) and flag-immunoprecipitated material (B) were separated on 10% 
SDS-PAGE and immunoblotted with anti-Herp antibodies. 

20 Figure 35 shows that the compounds CAS number 14567-55-4 and CAS 

number 414908-38-0 (lanes 7 and 8) inhibit HBV production. 

Figure 36 provides the nucleic acid and amino acid sequences of POSH-APs. 

DETAILED DESCRIPTION OF THE APPLICATION 
25 1. Definitions 

The term "binding" refers to a direct association between two molecules, due 
to, for example, covalent, electrostatic, hydrophobic, ionic and/or hydrogen-bond 
interactions under physiological conditions. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid 
sequence encoding a polypeptide with a second amino acid sequence defining a 
domain foreign to and not substantially homologous with any domain of the first 
amino acid sequence. A chimeric protein may present a foreign domain which is 
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found ( albeit i n a different protein) in an organism which also expresses the first 
protein, or it may be an "interspecies", "intergenic", etc. fusion of protein structures 
expressed by different kinds of organisms. 

The terms "compound", 'test compound" and "molecule" are used herein 
5 interchangeably and are meant to include, but are not limited to, peptides, nucleic 
acids, carbohydrates, small organic molecules, natural product extract libraries, and 
any other molecules (including, but not limited to, chemicals, metals and 
organometallic compounds). 

The phrase "conservative amino acid substitution" refers to grouping of 

10 amino acids on the basis of certain common properties. A functional way to define 
common properties between individual amino acids is to analyze the normalized 
frequencies of amino acid changes between corresponding proteins of homologous 
organisms (Schulz, G. E. and R. H. Schirmer., Principles of Protein Structure, 
Springer- Verlag). According to such analyses, groups of amino acids may be 

15 defined where amino acids within a group exchange preferentially with each other, 
and therefore resemble each other most in their impact on the overall protein 
structure (Schulz, G. E. and R. H. Schirmer, Principles of Protein , Structure, 
Springer- Verlag). Examples of amino acid groups defined in this manner include: 
(i) a charged group, consisting of Glu and Asp, Lys, Arg and His, 

20 (ii) • a positively-charged group, consisting of Lys, Arg and His, 

(iii) a negatively-charged group, consisting of Glu and Asp, 

(iv) an aromatic group, consisting of Phe, Tyr and Trp, 

(v) a nitrogen ring group, consisting of His and Trp, 

(vi) a large aliphatic nonpolar group, consisting of Val, Leu and He, 
25 (vii) a slightly-polar group, consisting of Met and Cys, 

(viii) a small-residue group, consisting of Ser, Thr, Asp, Asn, Gly, Ala, Glu, Gin 
and Pro, 

(ix) an aliphatic group consisting of Val, Leu, lie, Met and Cys, and 

(x) a small hydroxyl group consisting of Ser and Thr. 

30 In addition to the groups presented above, each amino acid residue may form 

its own group, and the group formed by an individual amino acid may be referred to 
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simply by the one and/or three letter abbreviation for that amino acid commonly 
used in the art. 

A "conserved residue" is an amino acid that is relatively invariant across a 
range of similar proteins. Often conserved residues will vary only by being replaced 
5 with a similar amino acid, as described above for "conservative amino acid 
substitution". 

The term "domain" as used herein refers to a region of a protein that 
comprises a particular structure and/or performs a particular function. 

The term "envelope virus" as used herein refers to any virus that uses cellular 

10 membrane and/or any organelle membrane in the viral release process. 

"Homology" or "identity" or "similarity" refers to sequence similarity 
between two peptides or between two nucleic acid molecules. Homology and 
identity c an each be determined by comparing a position in each sequence which 
may b e a ligned for p urposes o f c omparison. W hen an equivalent p osition i n t he 

15 compared sequences is occupied by the same base or amino acid, then the molecules 
are i dentical a 1 1 hat p osition; w hen t he e quivalent s ite o ccupied b y t he s ame o r a 
similar amino acid residue (e.g., similar in steric and/or electronic nature), then the 
molecules can be referred to as homologous (similar) at that position. Expression as 
a percentage of homology/similarity or identity refers to a function of the number of 

20 identical or similar amino acids at positions shared by the compared sequences. A 
sequence which is "unrelated" or "non-homologous" shares less than 40% identity, 
though preferably less than 25% identity with a sequence of the present application. 
In comparing two sequences, the absence of residues (amino acids or nucleic acids) 
or presence of extra residues also decreases the identity and homology/similarity. 

25 The term "homology" describes a mathematically based comparison of 

sequence similarities which is used to identify genes or proteins with similar 
functions or motifs. The nucleic acid and protein sequences of the present 
application may be used as a "query sequence" to perform a search against public 
databases to, for example, identify other family members, related sequences or 

30 homologs. Such searches can be performed using the NBLAST and XBLAST 
programs (version 2.0) of Altschul, et al. (1990) J Mol. Biol. 215:403-10. BLAST 
nucleotide searches can be performed with the NBLAST program, score=100, 
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wordlength=12 to obtain nucleotide sequences homologous to nucleic acid 
molecules of the application. BLAST protein searches can be performed with the 
XBLAST program, score=50, wordlength=3 to obtain amino acid sequences 
homologous to protein molecules of the application. To obtain gapped alignments 
5 for comparison purposes, Gapped BLAST can be utiUzed as described in Altschul et 
al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and 
Gapped BLAST programs, the default parameters of the respective programs (e.g., 
XBLAST and BLAST) can be used. See http://www.ncbi.nlm.nih.gov. 

As used herein, "identity" means the percentage of identical nucleotide or 

10 amino acid residues at corresponding positions in two or more sequences when the 
sequences are aligned to maximize sequence matching, i.e., taking into account gaps 
and insertions. Identity can be readily calculated by known methods, including but 
not limited to those described in (Computational Molecular Biology, Lesk, A. M., 
ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and 

15 Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana 
Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G.» 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM 

20 J. Applied Math., 48: 1073 (1988). Methods to determine identity are designed to 
give the largest match between the sequences tested. Moreover, methods to 
determine identity are codified in publicly available computer programs. Computer 
program methods to determine identity between two sequences include, but are not 
limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 

25 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S. F. et al., J. 
Molec. Biol. 215: 403-410 (1990) and Altschul et al. Nuc. Acids Res. 25: 3389-3402 
(1997)). The BLAST X program is publicly available from NCBI and other sources 
(BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, Md. 20894; 
Altschul, S., et al., J. Mol. Biol. 215: 403-410 (1990). The well known Smith 

30 Waterman algorithm may also be used to determine identity. 

The term "isolated", as used herein with reference to the subject proteins and 
protein complexes, refers to a preparation of protein or protein complex that is 
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essentially free from contaminating proteins that normally would be present with the 
protein or complex, e.g., in the cellular milieu in which the protein or complex is 
found endogenously. Thus, an isolated protein complex is isolated from cellular 
components t hat n ormally would " contaminate" o r i nterfere w ith t he s tudy o f t he 
5 complex in isolation, for instance while screening for modulators thereof. It is to be 
understood, however, that such an "isolated" complex may incorporate other 
proteins the modulation of which, by the subject protein or protein complex, is being 
investigated. 

The term "isolated" as also used herein with respect to nucleic acids, such as 
10 DNA or RNA, refers to molecules in a form which does not occur in nature. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments 
which are not naturally occurring as fragments and would not be found in the natural 
state. 

Lentiviruses include primate lentiviruses, e.g., human immunodeficiency 

15 virus types 1 and 2 (HIV-l/HIV-2); simian immunodeficiency virus (SIV) from 
. Chimpanzee (SIVcpz), Sooty mangabey (SlVsmm), African Green MonJkey 
(SIVagm), S yke ? s m onkey (SIVsyk), M andrill ( SIVmnd) and M acaque (STVmac). 
Lentiviruses also include feline lentiviruses, e.g., Feline immunodeficiency virus 
(FIV); Bovine lentiviruses, e.g., Bovine immunodeficiency virus (BIV); Ovine 

20 lentiviruses, e.g., Maedi/Visna virus (MW) and Caprine arthritis encephalitis virus 
(CAEV); and Equine lentiviruses, e.g., Equine infectious anemia virus (EIAV). All 
lentiviruses express at least two additional regulatory proteins (Tat, Rev) in addition 
to Gag, Pol, and Env proteins. Primate lentiviruses produce other accessory proteins 
including Nef, Vpr, Vpu, Vpx, and Vif. Generally, lentiviruses are the causative 

25 agents of a variety of disease, including, in addition to immunodeficiency, 
neurological degeneration, and arthritis. Nucleotide sequences of the various 
lentiviruses can be found in Genbank under the following Accession Nos. (from J. 
M. Coffin, S. H. Hughes, and H. E. Varmus, "Retroviruses" Cold Spring Harbor 
Laboratory Press, 1 99,7 p 8 04): 1 ) HIV-1: K03455, M 19921, K02013, M3843 1 , 

30 M38429, K02007 and M17449; 2) HIV-2: M30502, J04542, M30895, J04498, 
M15390, M31113 and L07625; 3) SIV:M29975, M30931, M58410, M66437, 
L06042, M33262, M19499, M32741, M31345 and L03295; 4) FIV: M25381, 
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M36968 and Ul 1820; 5)BIV. M32690; 6) E1AV: M16575, M87581 and U01866; 
6) Visna: M10608, M5 1 543, L 06906, M60609 and M60610; 7) CAEV: M33677; 
and 8) Ovine lentivirus M31646 and M34193. Lentiviral DNA can also be obtained 
from the American Type Culture Collection (ATCC). For example, feline 
5 immunodeficiency virus is available under ATCC Designation No. VR-2333 and 
VR-3112. Equine infectious anemia virus A is available under ATCC Designation 
No. VR-778. Caprine arthritis-encephalitis virus is available under ATCC 
Designation No. VR-905. Visna virus is available under ATCC Designation No. 
VR-779. 

10 As used herein, the term "nucleic acid" refers to polynucleotides such as 

deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The 
term should also be understood to include, as equivalents, analogs of either RNA or 
DNA made from nucleotide analogs, and, as applicable to the embodiment being 
described, single-stranded (such as sense or antisense) and double-stranded 

15 polynucleotides. 

The term "maturation" as used herein refers to the production, post- 
translational processing, assembly and/or release of proteins that form a viral 
particle. Accrodingly, this includes the processing of viral proteins leading to the 
pinching off of nascent virion from the cell membrane. 
20 A "POSH nucleic acid" is a nucleic acid comprising a sequence as 

represented in any of SEQ ID Nos:l, 3, 4, 6, 8, and 10 as well as any of the variants 
described herein. 

A "POSH polypeptide" or "POSH protein" is a polypeptide comprising a 
sequence as represented in any of SEQ ID Nos: 2, 5, 7, 9 andl 1 as well as any of the 
25 variations described herein. 

A "POSH-associated protein" or "POSH-AP" refers to a protein capable of 
interacting with and/or binding to a POSH polypeptide. Generally, the POSH-AP 
may interact directly or indirectly with the POSH polypeptide. Preferred POSH-APs 
include those provided in Table 7. Other preferred POSH-APs include those listed 
30 in Table 8. Examples of these and other POSH-APs are provided throughout. 
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The terms peptides, proteins and polypeptides are used interchangeably 

herein. 

The term "purified protein" refers to a preparation of a protein or proteins 
which are preferably isolated from, or otherwise substantially free of, other proteins 
5 normally associated with the protein(s) in a cell or cell lysate. The term 
"substantially free of other cellular proteins" (also referred to herein as "substantially 
free of other contaminating proteins") is defined as encompassing individual 
preparations of each of the component proteins comprising less than 20% (by dry 
weight) contaminating protein, and preferably comprises less than 5% contaminating 

10 protein. Functional forms of each of the component proteins can be prepared as 
purified preparations by using a cloned gene as described in the attached examples. 
By "purified", it is meant, when referring to component protein preparations used to 
generate a reconstituted protein mixture, that the indicated molecule is present in the 
substantial absence of other biological macromolecules, such as other proteins 

15 (particularly other proteins which may substantially mask, diminish, confuse or alter 
the characteristics of the component proteins either as purified preparations or in 
their function in the subject reconstituted mixture). The term "purified" as used 
herein preferably means at least 80% by dry weight, more preferably in the range of 
85% by weight, more preferably 95-99% by weight, and most preferably at least 

20 99.8% by weight, of biological macromolecules of the same type present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight 
of less than 5000, can be present). The term "pure" as used herein preferably has the 
same numerical limits as "purified" immediately above. 

A "recombinant nucleic acid*' is any nucleic acid that has been placed 

25 adjacent to another nucleic acid by recombinant DNA techniques. A "recombined 
nucleic acid" also includes any nucleic acid that has been placed next to a second 
nucleic acid by a laboratory genetic technique such as, for example, tranformation 
and integration, transposon hopping or viral insertion. In general, a recombined 
nucleic acid is not naturally located adjacent to the second nucleic acid. 

30 The term "recombinant protein" refers to a protein of the present application 

which is produced by recombinant DNA techniques, wherein generally DNA 
encoding the expressed protein is inserted into a suitable expression vector which is 
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in turn used to transform a host cell to produce the heterologous protein. Moreover, 
the phrase "derived from", with respect to a recombinant gene encoding the 
recombinant protein is meant to include within the meaning of "recombinant 
protein*' those proteins having an amino acid sequence of a native protein, or an 
5 amino acid sequence similar thereto which is generated by mutations including 
substitutions and deletions of a naturally occurring protein. 

A "RING domain" or tc Ring Finger" is a zinc-binding domain with a defined 
octet of cysteine and histidine residues. Certain RING domains comprise the 
consensus sequences as set forth below (amino acid nomenclature is as set forth in 
10 Table 1): Cys Xaa Xaa Cys Xaaio - 20 Cys Xaa His Xaa 2 . 5 Cys Xaa Xaa Cys Xaa l3 - 5 o 
Cys Xaa Xaa Cys or Cys Xaa Xaa Cys Xaaio - 20 Cys Xaa His Xaa 2 -5 His Xaa Xaa 
Cys Xaai3-5o Cys Xaa Xaa Cys. Certain RING domains are represented as amino 
acid sequences that are at least 80% identical to amino acids 12-52 of SEQ ID NO: 2 
and is set forth in SEQ ID No: 26. Preferred RING domains are 85%, 90%, 95%, 
15 98% and, most preferably, 100% identical to the amino acid sequence of SEQ ID 
NO: 26. Preferred RING domains of the application bind to various protein partners 
to form a complex that has ubiquitin ligase activity. RING domains preferably 
interact with at least one of the following protein types: F box proteins, E2 ubiquitin 
conjugating enzymes and cullins. 
20 The term "RNA interference" or "RNAi" refers to any method by which 

expression of a gene or gene product is decreased by introducing into a target cell 
one or more double-stranded RNAs which are homologous to the gene of interest 
(particularly to the messenger RNA of the gene of interest). RNAi may also be 
achieved by introduction of a DNA:RNA hybrid wherein the antisense strand 
25 (relative to the target) is RNA. Either strand may include one or more modifications 
to the base or sugar-phosphate backbone. Any nucleic acid preparation designed to 
achieve an RNA interference effect is referred to herein as an siRNA construct. 
Phosphorothioate is a particularly common modification to the backbone of an 
siRNA construct. 

30 "Small molecule" as used herein, is meant to refer to a composition, which 

has a molecular weight of less than about 5 kD and most preferably less than about 
2.5 kD. Small molecules can be nucleic acids, peptides, polypeptides, 
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peptidomimetics, carbohydrates, lipids or other organic (carbon containing) or 
inorganic molecules. Many pharmaceutical companies have extensive libraries of 
chemical and/or biological mixtures comprising arrays of small molecules, often 
fungal, bacterial, or algal extracts, which can be screened with any of the assays of 
5 the application. 

An 4< SH3" o r "Src H omology 3 " d omain i s a p rotein d omain o f g enerally 
about 60 amino acid r esidues first identified as a conserved sequence in the non- 
catalytic part of several cytoplasmic protein tyrosine kinases (e.g., Src, Abl, Lck). 
SH3 domains mediate assembly of specific protein complexes via binding to 

10 proline-rich peptides. Exemplary SH3 domains are represented by amino acids 137- 
192, 199-258, 448-505 and 832-888 of SEQ ID NO:2 and are set forth in SEQ ID 
Nos: 27-30. In certain embodiments, an SH3 domain interacts with a consensus 
sequence of RXaaXaaPXaaX6P (where X6, as defined in table 1 below, is a 
hydrophobic amino acid). In certain embodiments, an SH3 domain interacts with 

15 one or more of the following sequences: P(T/S)AP, PFRDY, RPEPTAP, 
RQGPKEP, RQGPKEPFR, RPEPTAPEE and RPLPVAP. 

As used herein, the term "specifically hybridizes" refers to the ability of a 
nucleic acid probe/primer of the application to hybridize to at least 12, 15, 20, 25, 
30, 35, 40, 45, 50 or 100 consecutive nucleotides of a POSH sequence, or a 

20 sequence complementary thereto, or naturally occurring mutants thereof, such that it 
has less than 15%, preferably less than 10%, and more preferably less than 5% 
background hybridization to a cellular nucleic acid (e.g., mRNA or genomic DNA) 
other than the POSH gene. A variety of hybridization conditions may be used to 
detect specific hybridization, and the stringency is determined primarily by the wash 

25 stage of the hybridization assay. Generally high temperatures and low salt 
concentrations give high stringency, while low temperatures and high salt 
concentrations give low stringency. Low stringency hybridization is achieved by 
washing in, for example, about 2.0 x SSC at 50 °C, and high stringency is acheived 
with about 0.2 x SSC at 50 °C. Further descriptions of stringency are provided 

30 below. 

As applied to polypeptides, "substantial sequence identity" means that two 
peptide sequences, when optimally aligned, such as by the programs GAP or 
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BESTFIT using default gap which share at least 90 percent sequence identity, 
preferably at least 95 percent sequence identity, more preferably at least 99 percent 
sequence identity or more. Preferably, residue positions which are not identical 
differ by conservative amino acid substitutions. For example, the substitution of 
5 amino acids having similar chemical properties such as charge or polarity are not 
likely to effect the properties of a protein. Examples include glutamine for 
asparagine or glutamic acid for aspartic acid. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such duplicate genes may be 
10 identical or may have certain modifications, including nucleotide substitutions, 
additions or deletions, which all still code for polypeptides having substantially the 
same activity. 

A "virion" is a complete viral particle; nucleic acid and capsid (and a lipid 
envelope in some viruses. A "viral particle" may be incomplete, as when produced 
15 by a cell transfected with a defective virus (e.g., an HIV virus-like particle system). 



Table 1: Abbreviations for classes of amino acids* 



Symbol 


Category 


Amino Acids 
Represented 


XI 


Alcohol 


Ser, Thr 


X2 


Aliphatic 


He, Leu, Val 


Xaa 


Any 


Ala, Cys, Asp, Glu, Phe, 
Gly, His, lie, Lys, Leu, 
Met, Asn, Pro, Gin, Arg, 
Ser, Thr, Val, Trp, Tyr 


X4 


Aromatic 


Phe, His, Trp, Tyr 


X5 


Charged 


Asp, Glu, His, Lys, Arg 
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X6 


Hydrophobic 


Ala, Cys, Phe, Gly, His, ! 
lie, Lys, Leu, Met, Thr, 
Val, Trp, Tyr 


X7 


Negative 


Asp, Glu 


X8 


Polar 


Cys, Asp, Glu, His, Lys, 
Asn, Gin, Arg, Ser, Thr 


X9 


Positive 


His, Lys, Arg 


X10 


Small 


Ala, Cys, Asp, Gly, Asn, 
Pro, Ser, Thr, Val 


XI 1 


Tiny 


Ala, Gly, Ser 


X12 


Turnlike 


Ala, Cys, Asp, Glu, Gly, 
His, Lys, Asn, Gin, Arg, 
Ser,Thr 


X13 


Asparagine-Aspartate 


Asn, Asp 



* Abbreviations as adopted from http://smart.embl- 
heidelberg.de/SMARTJDATA/ali 



2. Overview 

In certain aspects, the application relates to the discovery of novel 
5 associations between POSH proteins and other proteins (termed POSH-APs), and 
related methods and compositions. In certain aspects, the application relates to 
novel associations among certain disease states, POSH nucleic acids and proteins, 
and POSH-AP nucleic acids and proteins. 

In certain aspects, by identifying proteins associated with POSH, and 
10 particularly human POSH, the present application provides a conceptual link 
between the POSH-APs and cellular processes and disorders associated with POSH- 
APs, and POSH itself. Accordingly, in certain embodiments of the disclosure, 
agents that modulate a POSH-AP may now be used to modulate POSH functions 
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and disorders associated with POSH function, such as viral disorders, POSH- 
associated cancers, and POSH-associated neural disorders. Additionally, test agents 
may be screened for an effect on a POSH-AP and then further tested for an effect on 
a POSH function or a disorder associated with POSH function. Likewise, in certain 
5 embodiments of the disclosure, agents that modulate POSH may now be used to 
modulate POSH-AP functions and disorders associated with POSH-AP function, 
including a variety of cancers. Additionally, test agents may be screened for an 
effect on POSH and then further tested for effect on a POSH-AP function or a 
disorder associated with POSH-AP function. In further aspects, the application 
10 provides nucleic acid agents (e.g., RNAi probes, antisense nucleic acids), antibody- 
related agents, small molecules and other agents that affect POSH function, and the 
use of same in modulating POSH and/or POSH-AP activity. 

POSH intersects with and regulates a wide range of key cellular functions 
that may be manipulated by affecting the level of and/or activity of POSH 
15 polypeptides or POSH-AP polypeptides. Many features of POSH, and particularly 
human POSH, are described in PCT patent publications WO03/095971A2 
(application no. WO2002US0036366) and WO03/078601A2 (application no. 
WO2003US0008194) the teachings of which are incorporated by reference herein. 

As described in the above-referenced publications, native human POSH is a 
20 large polypeptide containing a RING domain and four SH3 domains. POSH is a 
ubiquitin ligase (also termed an "E3" enzyme); the RING domain mediates 
ubiquitination of, for example, the POSH polypeptide itself. POSH interacts with a 
large number of proteins and participates in a host of different biological processes. 
As demonstrated in this disclosure, POSH associates with a number of different 
25 proteins in the cell. POSH co-localizes with proteins that are known to be located in 
the trans-Golgi network, implying that POSH participates in the trafficking of 
proteins in the secretory system. The term "secretory system" should be understood 
as referring to the membrane compartments and associated proteins and other 
molecules that are involved in the movement of proteins from the site of translation 
30 to a location within a vacuole, a compartment in the secretory pathway itself, a 
lysosome or endosome or to a location at the plasma membrane or outside the cell. 
Commonly cited examples of compartments in the secretory system include the 



-26- 



WO 2004/078130 



PCT7US2004/006308 



endoplasmic reticulum, the Golgi apparatus and the cis and trans Golgi networks. In 
addition, Applicants have demonstrated that POSH is necessary for proper secretion, 
localization or processing of a variety of proteins, including phospholipase D, HTV 
Gag, HTV Nef, Rapsyn and Src. Many of these proteins are myristoylated, 
5 indicating that POSH plays a general role in the processing and proper localization 
of myristoylated proteins. N-myristoylation is an acylation process, which results in 
covalent attachment of myristate, a 14-carbon saturated fatty acid to the N-terminal 
glycine of proteins (Farazi et al., J. Biol. Chem. 276: 39501-04 (2001)). N- 
myristoylation occurs co-translationaly and promotes weak and reversible protein- 
10 membrane interaction. Myristoylated proteins are found both in the cytoplasm and 
associated with membrane. Membrane association is dependent on protein 
configuration, i.e., surface accessibility of the myristoyl group may be regulated by 
protein modifications, such as phosphorylation, ubiquitination etc. Modulation of 
intracellular transport of myristoylated proteins in the application includes effects on 
1 5 transport and localization of these modified proteins. 

As described herein, POSH and POSH-APs are involved in viral maturation, 
including the production, post-translational processing, assembly and/or release of 
proteins in a viral particle. Accordingly, viral infections may be ameliorated by 
inhibiting an activity (e.g., ubiquitin ligase activity or target protein interaction) of 
20 POSH or a POSH-AP (e.g., inhibition of kinase activity or ubiquitin ligase activity), 
and in preferred embodiments, the virus is a retroid virus, an RNA virus or an 
envelope virus, including HIV, Ebola, HBV, HCV, HTLV, West Nile Virus (WNV) 
or Moloney Murine Leukemia Virus (MMuLV). Additional viral species are 
described in greater detail below. In certain instances, a decrease of a POSH 
25 function is lethal to cells infected with a virus that employs POSH in release of viral 
particles. 

In certain aspects, the application describes an hPOSH interaction with Rac, 
a small GTPase and the POSH associated kinases MLK, MKK and JNK. Rho, Rac 
and Cdc42 operate together to regulate organization of the actin cytoskeleton and the 
30 MLK-MKK-JNK MAP kinase pathway (referred to herein as the "JNK pathway" or 
"Rac- JNK pathway" (Xu et al., 2003, EMBO J. 2: 252-61). Ectopic expression of 
mouse POSH ("mPOSH") activates the JNK pathway and causes nuclear 



-27- 



WO 2004/078130 



PCT/US2004/006308 



localization of NF-kB. Overexpression of mPOSH in fibroblasts stimulates 
apoptosis. (Tapon et al. (1998) EMBO J. 17:1395-404). In Drosophila, POSH may 
interact with, or otherwise influence the signaling of, another GTPase, Ras. 
(Schnorr et al. (2001) Genetics 159: 609-22). The JNK pathway and NF-kB 

5 regulate a variety of key genes involved in, for example, immune responses, 
inflammation, cell proliferation and apoptosis. For example, NF-kB regulates the 
production of interleukin 1, interleukin 8, tumor necrosis factor and many cell 
adhesion molecules. NF-kB has both pro-apoptotic and anti-apoptotic roles in the 
cell (e.g., in FAS-induced cell death and TNF-alpha signaling, respectively). NF-kB 

10 is negatively regulated, in part, by the inhibitor proteins IkBoc and JkBp (collectively 
termed "IkB"). Phosphorylation of IkB permits activation and nuclear localization 
of NF-kB. Phosphorylation of IkB triggers its degradation by the ubiquitin system. 
In an additional embodiment, a POSH polypeptide promotes nuclear localization of 
NF-kB. In further embodiments, manipulation of POSH levels and/or activities may 

15 be used to manipulate apoptosis. By upregulating POSH or a POSH-AP, apoptosis 
may be stimulated in certain cells, and this will generally be desirable in conditions 
characterized by excessive cell proliferation (e.g., in certain cancers). By 
downregulating POSH or a POSH-AP, apoptosis may be diminished in certain cells, 
and this will generally be desirable in conditions characterized by excessive cell 

20 death, such as myocardial infarction, stroke, degenerative diseases of muscle and 
nerve (particularly Alzheimer's disease), and for organ preservation prior to 
transplant. In a further embodiment, a POSH polypeptide associates with a vesicular 
trafficking complex, such as a clathrin- or coatomer- containing complex, and 
particularly a trafficking complex that localizes to the nucleus and/or Golgi 

25 apparatus. 

As described in WO03/078601A2 (application no. WO2003US0008194), 
POSH is overexpressed in a variety of cancers, and downregulation of POSH is 
associated with a decrease in proliferation in at least one cancer cell line. 
Accordingly, agents that modulate POSH itself or a POSH-AP may be used to treat 
30 POSH associated cancers. POSH associated cancers include those cancers in which 
POSH is overexpressed and/or in which downregulation of POSH leads to a 
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decrease in the proliferation or survival of cancer cells. POSH-associated cancers 
are described in more detail below. In addition, it is notable that many proteins 
shown herein to be affected by POSH downregulation are themselves involved in 
cancers. Phospholipase D and SRC are both aberrantly processed in a POSH- 
5 impaired cell, and therefore modulation of POSH and/or a POSH-AP may affect the 
wide range of cancers in which PLD and SRC play a significant role. 

As described in WO03/095971 A2 (application no. WO2002US0036366) and 
WO03/078601A2 (application no. WO2003US0008194), POSH polypeptides 
function as E3 enzymes in the ubiquitination system. Accordingly, downregulation 

10 or upregulation of POSH ubiquitin ligase activity can be used to manipulate 
biological processes that are affected by protein ubiquitination. Modulation of 
POSH ubiquitin ligase activity may be used to affect POSH-APs and related 
biological processes, and likewise, modulation of POSH-APs may be used to affect 
POSH ubiquitin ligase activity and related processes. Downregulation or 

15 upregulation may be achieved at any stage of POSH formation and regulation, 
including transcriptional, translational or post-translational regulation. For example, 
POSH transcript levels may be decreased by RNAi targeted at a POSH gene 
sequence. As another example, POSH ubiquitin ligase activity may be inhibited by 
contacting POSH with an antibody that binds to and interferes with a POSH RING 

20 domain or a domain of POSH that mediates interaction with a target protein (a 
protein that is ubiquitinated at least in part because of POSH activity). As a further 
example, small molecule inhibitors of POSH ubiquitin ligase activity are provided 
herein. As another example, POSH activity may be increased by causing increased 
expression of POSH or an active portion thereof. POSH, and POSH-APs that 

25 modulate POSH ubiquitin ligase activity may participate in biological processes 
including, for example, one or more of the various stages of a viral lifecycle, such as 
viral entry into a cell, production of viral proteins, assembly of viral proteins and 
release of viral particles from the cell. POSH may participate in diseases 
characterized by the accumulation of ubiquitinated proteins, such as dementias (e.g., 

30 Alzheimer's and Pick's), inclusion body myositis and myopathies, polyglucosan 
body myopathy, and certain forms of amyotrophic lateral sclerosis. POSH may 
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participate in diseases characterized by excessive or inappropriate ubiquitination 

and/or protein degradation. 

3. POSH Associated Proteins 

In certain aspects, the application relates to the discovery of novel 
5 associations between POSH proteins and other proteins (termed POSH-APs), and 
related methods and compositions. In certain aspects, the application relates to 
novel associations among certain disease states, POSH nucleic acids and proteins, 
and POSH-AP nucleic acids and proteins. POSH-APs may interact either directly or 
indirectly with POSH. In certain embodiments, a POSH-AP binds directly to a 

1 0 POSH polypeptide. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with one subunit of Protein Kinase A (PKA; cAMP-dependent 
protein kinase). In one aspect, the application relates to the discovery that POSH 
binds directly with PRKAR1 A. This interaction was identified by Applicants in a 

15 yeast 2-hybrid assay. Exemplary PKA subunits may include, but are not limited to, 
a regulatory subunit (e.g., PRKAR1A) and a catalytic subunit (e.g., PRKACA or 
PRKACB). PKA is an essential enzyme in the signaling pathway of the second 
messenger cyclic AMP (cAMP). Through phosphorylation of target proteins, PKA 
controls many biochemical events in the cell including regulation of metabolism, ion 

20 transport, and gene transcription. The PICA holoenzyme is composed of two 
regulatory and two catalytic subunits and dissociates from the regulatory subunits 
upon binding of cAMP. The PKA enzyme is inactive in the absence of cAMP. 
Activation of PKA occurs when two cAMP molecules bind to each regulatory 
subunit, eliciting a reversible conformational change that releases active catalytic 

25 subunits. 

A number of human PKA subunits have been characterized, including a 
regulatory subunit (type I alpha: PRKAR1) and two catalytic subunits (C-alpha: 
PRKACA; and C-beta: PRKACB). Boshart et al. identified the regulatory subunit 
PRKAR1 of PKA as the product of the TSE1 locus (Boshart, M et al. (1991) Cell 
30 66: 849-859). The evidence consisted of concordant expression of PRKAR1 mRNA 
and TSE1 genetic activity, high resolution physical mapping of the two genes on 
human chromosome 17, and the ability of transfected PRKAR1 cDNA to generate a 
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phenocopy of TSE1 -mediated extinction. Jones et al. independently established 
identity of TSE1 and the Rl-alpha subunit (Jones, KW et al. (1991) Cell 66: 861- 
872). 

Other than a role of PKA in metabolism, PKA subunits have recently been 
5 implicated in multiple diseases. For example, a specific role for localized PRKAR1 
has been demonstrated in human T lymphocytes, where type I PKA localizes to the 
activated TCR complex and is required for attenuation of signals propagated through 
this complex (Skalhegg, BS et al. (1992) J Biol Chem 267:15707-15714; Skalhegg, 
BS et al. (1994) Science 263: 84-87). The importance of type I PKA-mediated 

10 effects in attenuation of T cell replication has led to its consideration as a therapeutic 
target in combined variable immunodeficiency (CVT) and acquired immune 
deficiency syndrome (AIDS). Furthermore, type I PKA in T cells may also serve as 
a potential therapeutic target in systemic lupus erythematosis (SLE). For example, a 
series of recently published articles has uncovered the first human disease mapping 

15 to a PKA subunit-Carney complex (Casey, M et al. (2000) J Clin Invest 106: R31- 
38; Kirschner, LS et al. (2000) Nat Genet 26: 89-92). Carney complex (CNC) is a 
multiple neoplasia syndrome characterized by spotty skin pigmentation, cardiac and 
skin myxomas, endocrine tumors, and psammomatous melanotic schwannomas. 
CNC maps to two genomic loci, 17q24 and 2pl6. Familial cases mapping to the 

20 17q24 locus reveal deletions/mutations in the PRKAR1 coding exons leading to 
frameshifts and premature stop codons — no mRNA and protein from the mutant 
alleles has been observed. 

Accordingly, in certain aspects of the present disclosure, POSH participates 
in the formation of PKA complexes, including human PKA-containing complexes. 

25 Certain P OSH p olypeptides m ay b e i nvolved i n d isorders o f t he i mmune s ystem, 
e.g., autoimmune disorders. Certain POSH polypeptides may be involved in the 
regulation of T-cell activation. In certain aspects, POSH participates in the 
ubiquitination of PI3K. In certain aspects, PKA subunit polypeptides participate in 
POSH-rnediated processes. 

30 Additionally, the disclosure relates in part to the discovery that PKA 

phosphorylates POSH, and further, that this phosphorylation inhibits the interaction 
of POSH with small GTPases, such as Rac. Small GTPases are important in 
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vesicular trafficking, and therefore the findings disclosed herein demonstrate that 
POSH phosphorylation regulates the formation of complexes between POSH and 
proteins involved in the secretory system, such as Rac, TCL, TC10, Cdc42, Wrch-1, 
Rac2, Rac3 or RhoG. Applicants have shown that inhibition of PKA and POSH has 
5 similar effects, indicating that inhibition of PKA will achieve an effect similar to 
that of inhibition of POSH. However, given the effect of PKA on POSH interaction 
with proteins in the secretory pathway, it is expected that PKA regulates the timing 
of cyclical interactions that are needed to effect vesicular trafficking. Accordingly, 
it is expected that significant inhibition or activation of PKA will cause a disruption 

10 in POSH function. 

The term "PKA subunit" is used herein to refer to a full-length human PKA 
subunit which includes a regulatory subunit (e.g., PRKAR1A) and a catalytic 
subunit (e.g., PRKACB or PRKACA), as well as an alternative PKA subunit 
composed of separate PKA subunit sequences (e.g., nucleic acid sequences) that 
15 may be a splice variant. The term "PKA subunit" is used herein to refer as well to 
various naturally occurring PKA subunit homologs, as well as functionally similar 
variants and fragments that retain at least 80%, 90%, 95%, or 99% sequence identity 
to a naturally occurring PKA subunit (e.g., SEQ ID NOs: 264-265, 111-122, 395- 
396). The term specifically includes human PKA subunit nucleic acid and amino 
20 acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with human UNC84B, a human homolog of C. elegans Unc- 
84. Accordingly, the application provides complexes comprising POSH and 
UNC84B. In one aspect, the application relates to the discovery that POSH binds 
25 directly with TJNC84B. This interaction was identified by Applicants in a yeast 2- 
hybrid assay. In C. elegans, Unc-84 is involved in the cellular positioning of the 
nucleus. UNC84/SUN is positioned at the nuclear membrane and recruits 
Syne/ANC-1, which directly tethers the nuclear envelope to the actin cytoskeleton. 
Accordingly, in certain aspects, POSH participates in formation of a UNC84 
30 complexes, including human UNC84B-containing complexes, and in the 
connections between the nucleus and the cytoskeleton. In certain aspects, UNC84 
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polypeptides participate in POSH-mediated processes. See, for example, Starr and 
Han, 2003, J Cell Sci 116(Pt 2):211-6. 

The term UNC84 is used herein to refer to various naturally occurring Unc- 
84 homologs, as well as functionally similar variants and fragments that retain at 
5 least 8 0%, 9 0%, 9 5%, o r 9 9% se quence i dentity t o a n aturally o ccurring U NC84 
(e.g., SEQ ID NOs: 314, 211-213 ). The term specifically includes human UNC84B 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with human GOCAP1. Certain GOCAP1 polypeptides are 

10 cytoplasmic proteins associated with the Golgi complex. Accordingly, the 
application provides complexes comprising POSH and GOCAP1. In one aspect, the 
application relates to the discovery that POSH binds directly with GOCAP1. This 
interaction was identified by Applicants in a yeast 2-hybrid assay. In certain 
aspects, these complexes associate with the Golgi complex. GOCAP1 is 

15 synonymous with GCP60. Certain GCP60 polypeptides interact with the Golgi 
complex integral membrane protein, giantin. Certain GCP60 polypeptides are 
involved in the maintenance of the Golgi structure through interaction with giantin 
and affect protein transport between the endoplasmic reticulum and the Golgi 
complex (Sohda, M, et al. (2001) J Biol Chem 276:45298-306). In certain aspects, 

20 GOCAP1 polypeptides participate in POSH-mediated processes. 

The term GOCAP1 is used herein to refer to various naturally occurring 
GOCAP1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
GOCAP1 (e.g., SEQ ID NOs: 240-243, 61-68). The term specifically includes 

25 human GOCAP1 nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with human PTPN12, a protein tyrosine phosphatase. 
Accordingly, the application provides complexes comprising P OSH and PTPN12. 

30 In one aspect, the application relates to the discovery that POSH binds directly with 
PTPN12. This interaction was identified by Applicants in a yeast 2-hybrid assay. 
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PTPN12 polypeptides are synonymous with the protein tyrosine phosphatase, PTP- 
PEST. PTP-PEST polypeptides contain proline-rich sequences and are rich in 
proline, ghitamate, serine, and threonine residues at their carboxyl terminus, features 
characteristic of PEST motifs. Certain PTP-PEST polypeptides interact with 
5 paxillin, a scaffolding protein to which focal adhesion proteins bind, leading to the 
formation of the focal adhesion contact (Shen, Y et al. (1998) J Biol Chem 
273:6474-81). Certain PTP-PEST polypeptides associate with the focal adhesion 
protein, pl30cas (Garton, AJ et al. (1997) Oncogene 15:877-85). Certain PTP- 
PEST polypeptides have also been shown to associate with JAK2, PSTPIP and 
10 WASP, gelsolin, cell adhesion kinase beta, Csk, Hef 1 or Sin , Hic-5, or She (See, 
for example, Horsch, et al (2001) Mol Endocrinol 15:2182-96; Cote, et al (2002) J 
Biol Chem 277:2973-86; Chellaiah, et al (2001) J Biol Chem 276:47434-44; Lyons, 
et al (2001) J Biol Chem 276:24422-31; Davidson, et al (1997) J Biol Chem 
21:1077-88; Cote, JF et al (1998) Biochemistry 37:13128-37; Nishiya, N (1999) J 
15 Biol Chem 274:9847-53; Habib, T et al (1994) J Biol Chem 269:25243-6). Certain 
PTP-PEST polypeptides are involved in inactivation of the Ras pathway (Davidson, 
D and Veillette, A (2001) EMBO J 20:3414-26). The expression level of certain 
PTP-PEST polypeptides can modulate the activity of the GTPase, Racl (Sastry, et al 
(2002) J Cell Sci 115(Pt 22): 4305-16). Certain PTP-PEST polypeptides are 
20 involved in the regulation of cell motility (Garton, AJ and Tonks, NK (1999) J Biol 
Chem 274:3811-8; Angers-Loustau, et al (1999) J Cell Biol 144:1019-31; and 
Sastry, et al. (2002) J Cell Sci 155(Pt 22): 4305-16). Accordingly, certain POSH 
polypeptides are involved in inactivation of the Ras pathway. Certain POSH 
polypeptides are involved in the regulation of cell motility. 
25 Certain PTP-PEST polypeptides are involved in amyloid(3-induced neuronal 

dystrophy, a pathological hallmark of Alzheimer's disease (Grace, EA and 
Busciglio, J (2003) J Neurosci. 23:493-502). Accordingly, certain POSH 
polypeptides may be involved in Alzheimer's disease. Certain PTP-PEST 
polypeptides function as negative regulators of lymphocyte activation (Davidson, D 
30 and Veillette, A (2001) EMBO J 20:3414-26). Accordingly, certain POSH 
polypeptides may be involved in the regulation of lymphocyte activation. In certain 
aspects, PTPN12 polypeptides participate in POSH-mediated processes. 



-34- 



WO 2004/078130 



PCT/US2004/006308 



The term PTPN12 is used herein to refer to various naturally occurring 
PTPN12 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring PTPN12 
(e.g., SEQ ID NOs: 266-268, 123-129). The term specifically includes human 
5 PTPN12 nucleic acid and amino acid sequences and the sequences presented in 
Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with HERPUD1, a 'Tiomocysteine-inducible, endoplasmic 
reticulum stress-inducible, ubiquitin-like domain member 1" protein. Accordingly, 

10 the application provides complexes comprising POSH and HERPUD1. In one 
aspect, the application relates to the discovery that POSH binds directly with 
HERPUD1 . This interaction was identified by Applicants in a yeast 2-hybrid assay. 
HERPUD1 is synonymous with Herp. In part, the present application relates to the 
discovery that a POSH-AP, HERPUD1, is involved in the maturation of an envelope 

15 virus, such as HIV. 

Certain HERPUD1 polypeptides are involved in JNK-mediated apoptosis, 
particularly in vascular endothelial cells, including cells that are exposed to high 
levels of homocysteine. Certain HERPUD1 polypeptides are involved in the 
Unfolded Protein Response, a cellular response to the presence of unfolded proteins 
20 in the endoplasmic reticulum. Certain HERPUD1 polypeptides are involved in the 
regulation of sterol biosynthesis. Accordingly, certain POSH polypeptides are 
involved in the Unfolded Protein Response and sterol biosynthesis. 

In other aspects, certain HERPUD1 polypeptides enhance presenilin- 
mediated amyloid p-protein generation. For example, HERPUD1 polypeptides, 

25 when overexpressed in cells, increase the level of amyloid P generation, and it is 
observed that HERPUD1 polypeptides interact with the presenilin proteins, 
presenilin- 1 and presenilin-2. (See Sai, X. et al (2002) J. Biol. Chem. 277:12915- 
12920). Accordingly, in certain aspects, POSH polypeptides may modulate the level 
of amyloid (3 generation. Additionally, POSH polypeptides may interact with 

30 presenilin 1 and presenilin 2. Therefore, it is believed certain POSH polypeptides 
modulate nresenilin-mediated amyloid p generation. The accumulation of amyloid 
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P is one hallmark of Alzheimer's disease. Accordingly, these POSH polypeptides 
may be involved in the pathogenesis of Alzheimer's disease. At sites such as late 
intracellular compartment sites including the trans-Golgi network, certain mutant 
presenilin-2 polypeptides up-regulate production of amyloid (J peptides ending at 
5 position 42 (A042). (See Iwata, H. et al (2001) J. Biol. Chem. 276: 21678-21685). 
Accordingly, POSH polypeptides regulate production of Ap42 through mutant 
presenilin-2 at late intracellular compartment sites including the trans-Golgi 
network. Furthermore, elevated homocysteine levels have been found to be a risk 
factor associated with Alzheimer's disease and cerebral vascular disease. Some risk 

10 factors, such as elevated plasma homocysteine levels, may accelerate or increase the 
severity of several central nervous system (CNS) disorders. Elevated levels of 
plasma homocysteine were found in young male patients with schizophrenia 
suggesting that elevated homocysteine levels could be related to the 
pathophysiology of aspects of schizophrenia (Levine, J. et al (2002) Am. J. 

15 Psychiatry 159:1790-2). Accordingly, certain POSH polypeptides may be involved 
in neurological disorders. Neurological disorders include disorders associated with 
increased levels of plasma homocysteine, increased levels of amyloid P production, 
or aberrant presenilin acitivity. Neurological disorders include CNS disorders, such 
as Alzheimer's disease, cerebral vascular disease and schizophrenia. Certain POSH 

20 polypeptides may be involved in cardiovascular diseases, such as thromboembolic 
vascular disease, and particularly the disease characteristics associated with 
hyperhomocysteinemia. See, for example, Kokame et al. 2000 J. Biol. Chem. 
275:32846-53; Zhang et al. 2001 Biochem Biophys Res Commun 289:718-24. 

The t erm H ERPUD1 i s used h erein tor efer t o v arious n aturally o ccurring 
25 HERPUD1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
HERPUD1 (e.g., SEQ ID NOs: 249-252, 77-86). The term specifically includes 
human HERPUD1 nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 



30 In certain aspects, the application relates to the discovery that a POSH 

polypeptide interacts with one or more Cbl-b polypeptides. Accordingly, the 
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application provides complexes comprising POSH and Cbl-b. In one aspect, the 
application relates to the discovery that POSH binds directly with Cbl-b. This 
interaction was identified by Applicants in a yeast 2-hybrid assay. Cbl-b 
polypeptides contain an amino-terminal variant SH2 domain, a RING finger, and a 
5 carboxyl-terminal proline-rich domain with potential tyrosine phosphorylation sites. 
Cbl-b is highly homologous to the mammalian Cbl and the nematode Sli-1 proteins. 
This application provides four Cbl-b variants and shows that the POSH polypeptide 
interacts with one or more of these variants. In one aspect, the POSH polypeptide 
interacts with a human Cbl-b (UniGene No.: Hs.3144). In another aspect, the POSH 

10 polypeptide i nteracts w ith an alternative human Cbl-b (UniGene No.: Hs.381921) 
that may be a splice variant of Cbl-b. In yet another aspect, the POSH polypeptide 
interacts with a human Cbl-b polypeptide that is a splice variant represented by the 
amino acid sequence depicted in SEQ ID NO: 361, which is encoded by the nucleic 
acid sequence depicted in SEQ ID NO: 359. In yet another aspect, the POSH 

15 polypeptide interacts with a human Cbl-b polypeptide that is a splice variant 
represented by the amino acid sequence depicted in SEQ ID NO: 398, which is 
encoded by the nucleic acid sequence depicted in SEQ ID NO: 360. 

Certain Cbl-b polypeptides have been shown to function as adaptor proteins 
by interacting with other signaling molecules, e.g., interaction with cell surface 

20 receptor tyrosine kinases, e.g., EGFR (Ettenberg, SA et al (2001) J Biol Chem 
276:77-84) or with proteins such as Syk (Elly, C et al (1999) Oncogene 18:1147- 
56), Crk-L (Elly, C et al (1999) Oncogene 18:1147-56), PI3K (Fang, D et al. (2001) 
J Biol Chem 16:4872-8), Grb2 (Ettenberg, SA et al (1999) Oncogene 18:1855-66), 
or Vav (Bustelo, XR et al. (1997) Oncogene 15:2511-20). Certain Cbl-b 

25 polypeptides have been demonstrated to interact directly with the nucleotide 
exchange factor, V av (Bustelo, XR et al. ( 1997) Oncogene 1 5:251 1-20). C ertain 
Cbl-b polypeptides have been shown to function as an E3 ubiquitin ligase that 
recognizes tyrosine phosphorylated substrates through its SH2 domain and through 
its RING domain, recruits a ubiquitin-conjugating enzyme, E2 (Joazeiro, C et al. 

30 (1999) S cience 2 86:309-312) A dditionally, certain Cbl-b p olypeptides have b een 
shown to associate directly with the p85 subunit of PI3K and to function as an E3 
ligase in the ubiquitination of PI3K (Fang, D et al. (2001) J Biol Chem 16:4872-8). 
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Certain Cbl-b polypeptides are negative regulators of T-cell activation. Cbl- 
b-deficient mice become very susceptible to experimental autoimmune 
encephalomyelitis (Chiang, YJ et al. (2000) Nature 403:216-220). Also, Cbl-b- 
deficient mice develop spontaneous autoimmunity (Bachmaier, K, et al (2000) 
5 Nature 403:211-216). Furthermore, Cbl-b is a major susceptibility gene for rat type 
1 diabetes mellitus (Yokoi, N et al (2002) Nature Genet. 31:391-394). 

Accordingly, in certain aspects, POSH participates in the formation of Cbl-b 
complexes, including human Cbl-b-containing complexes. Certain POSH 
polypeptides may be involved in disorders of the immune system, e.g., autoimmune 
10 disorders. Certain POSH polypeptides may be involved in the regulation of T-cell 
activation. In certain aspects, POSH participates in the ubiquitination of PI3K. In 
certain aspects, Cbl-b polypeptides participate in POSH-mediated processes. 

The term Cbl-b is used herein to refer to full-length, human Cbl-b (UniGene 
No.: Hs.3144) as well as an alternative Cbl-b (UniGene No.: Hs.381921) composed 
15 of two separate Cbl-b sequences (e.g., nucleic acid sequences) that may be a splice 
variant. The term Cbl-b is used herein to refer as well to the human Cbl-b splice 
variant represented by the amino acid sequence of SEQ ID NO: 361, which is 
encoded by the nucleic acid sequence of SEQ ID NO: 359 and to the human Cbl-b 
splice variant represented by the amino acid sequence of SEQ ID NO: 398, which is 
20 encoded by the nucleic acid sequence of SEQ ED NO: 360. The term Cbl-b is used 
herein to refer as well to various naturally occurring Cbl-b homologs, as well as 
functionally similar variants and fragments that retain at least 80%, 90%, 95%, or 
99% sequence identity to a naturally occurring Cbl-b (e.g., SEQ ID NOs: 361, 398, 
227-230, 353-360 ). The term specifically includes human Cbl-b nucleic acid and 
25 amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with GOSR2. Accordingly, the application provides 
complexes comprising POSH and GOSR2. In one aspect, the application relates to 
the discovery that POSH binds directly with GOSR2. This interaction was 
30 identified by Applicants in a yeast 2-hybrid assay. Certain GOSR2 polypeptides are 
synonymous with GS27 (for Golgi SNARE of 27K) and are involved in trafficking 
membrane proteins between the endoplasmic reticulum and the Golgi and between 
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Golgi subcompartments such as between the cis-^nedial- and trans-Golgi network. 
(See, for example, Lowe, SL et al (1997) Nature 389:881-4 and Bui, ID et al (1999) 
57:285-8). Accordingly, certain POSH polypeptides are involved in the trafficking 
of membrane proteins between the endoplasmic reticulum and the Golgi and 
5 between Golgi subcompartments. 

The term GOSR2 is used herein to refer to various naturally occurring 
GOSR2 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring GOSR2 
(e.g., SEQ ID NOs: 244-248, 69-76). The term specifically includes human GOSR2 
10 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with RALA. Accordingly, the application provides complexes 
comprising POSH and RALA. In one aspect, the application relates to the discovery 
that POSH binds directly with RALA. This interaction was identified by Applicants 
15 in a yeast 2-hybrid assay. RALA polypeptides are GTP-binding polypeptides. 
RALA polypeptides are members of the Ras family of proteins and are GTPases. 
Certain RALA polypeptides may be synonymous with RalA polypeptides. RalA 
polypeptides are small GTPases. RalA polypeptides have been shown to interact 
with phospholipase D and to effect phospholipase D activity. Additionally, RalA 
20 polypeptides may be involved in tumor formation and cell transformation. (See, for 
example, Kim, JH et al (1998) FEBS Lett 430:231-5; Aguirre-Ghiso, JA et al (1999) 
Oncogene 18:4718-25; Lu, Z et al (2000) Mol Cell Biol 20:462-7; Gildea, JJ et al 
(2002) Cancer Res 62:982-5; Lucas, L et al (2002) Int J Oncol 21:477-85; and Xu, L 
et al (2003) Mol Cell Biol 23:645-54). Accordingly, certain POSH polypeptides 
25 may i nteract w ith P LD and m odulate i ts activity, and certain P OSH p olypeptides 
may be involved in tumor formation and cell transformation. In other aspects, 
certain RalA polypeptides interact with calmodulin and may be involved in 
calcium/calmodulin-mediated intracellular signaling pathways (Clough, RR et al 
(2002) J Biol Chem 277:28972-80). Certain RalA polypeptides are involved in 
30 controlling actin cytoskeletal remodeling and vesicle transport in mammalian cells. 
Certain RalA polypeptides interact with the exocyst complex, which is involved in 
exocytosis. (See, for example, Sugihara, K et al (2002) Nat Cell Biol 4:73-8; Polzin, 
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A et al (2002) Mol Cell Biol 22:1714-22; and Lipschutz, JH and Mostov, KE (2002) 
Curr Biol 1 2(6) :R2 12-4). Accordingly, certain POSH polypeptides are involved in 
vesicle transport. 

The term RALA is used herein to refer to various naturally occurring RALA 
5 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring RALA (e.g., 
SEQ ID NOs: 269-270, 130-134). The term specifically includes human RALA 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
10 polypeptide interacts with SMN1. Accordingly, the application provides complexes 
comprising POSH and SMN1. In one aspect, the application relates to the discovery 
that POSH binds directly with SMN1 . This interaction was identified by Applicants 
in a yeast 2-hybrid assay. SMN1 polypeptides are encoded by the nucleic acid of 
the survival motor neuron gene 1 (SMN1). Mutations in this gene (such as its 
* 15 homozygous absence) cause spinal muscular atrophy (SMA), a common autosomal 
recessive disorder characterized by degeneration of motor neurons in the spinal cord, 
leading to progressive paralysis with muscular atrophy. Accordingly, POSH may be 
involved in the pathogenesis of SMA. SMN1 is part of a multiprotein complex that 
is required for biogenesis of the Sm class of small nuclear ribonucleoproteins (Sm 
20 snRNPs). SMN1 associates with a number of proteins, such as Gemin2 to Gemin6, 
to form a large complex found in both the cytoplasm and in the nucleus. SMN1 also 
associates with Snurportin 1, an adaptor protein that recognizes the nuclear 
localization signal of Sm snRNPs. (See, for example, Lefebvre, S et al (1995) Cell 
80:155-65; Narayanan, U et al (2002) Hum Mol Genet 1 1 :1785-95; Massenet, S et al 
25 (2002) 22:6533-41; and Monani, UR et al (1999) Hum Mol Genet 8:1177-83). 
Accordingly, certain POSH polypeptides may be involved in the biogenesis of 
snRNPs. Certain SMN1 polypeptides interact with the large nonstructural protein 
NS1 of the autonomous parvovirus minute virus of mice (MVM). NS1 is essential 
for viral replication, and it is a potent transcriptional activator (Young, PJ et al 
30 (2002) J Virol 76:3892-904). Certain SMN1 polypeptides interact with the protein 
NS2 of MVM. NS2 is also required for efficient viral replication. Certain SMN1 
polypeptides colocalize with NS2 in infected nuclei and at late times following 



-40- 



WO 2004/078130 



PCT/US2004/006308 



MVM infection. (See Young, PJ et al (2002) J Virol 76:6364-9). Accordingly, 
POSH polypeptides are involved in viral replication. 

The term SMN1 is used herein to refer to various naturally occurring SMN1 
homologs, as well as functionally similar variants and fragments that retain at least 
5 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SMN1 (e.g., 
SEQ ID NOs: 273-275, 142-146). The term specifically includes human SMN1 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with SMN2. Accordingly, the application provides complexes 
10 comprising POSH and SMN2. In one aspect, the application relates to the discovery 
that POSH binds directly with SMN2. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. The SMN2 gene is an almost identical copy of the SMN1 
gene that causes SMA. A critical difference between the two genes is a 1 nucleotide 
base change inside exon 7 that affects the splicing pattern of the genes. The 
15 majority of the SMN2 transcript lacks exon 7. Certain SMN2 polypeptides 
influence the severity of SMA. (See, for example, Monani, UR et al (1999) Hum 
Mol Genet 8: 1177-83; Cartegni, L and Krainer, AR (2002) Nat Genet 30:377-84; 
and Feldkotter, M et al (2002) Am J Hum Genet 70: 358-68). Accordingly, certain 
POSH polypeptides may influence the severity of SMA. 
20 The term SMN2 is used herein to refer to various naturally occurring SMN2 

homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SMN2 (e.g., 
SEQ ID NOs: 276-280, 147-151). The term specifically includes human SMN2 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

25 In certain aspects, the application relates to the discovery that a POSH 

polypeptide interacts with SIAH1. Accordingly, the application provides complexes 
comprising POSH and SIAH1. In one aspect, the application relates to the discovery 
that POSH binds directly with SIAH1. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. Certain SIAH1 polypeptides bind ubiquitin-conjugating 
30 enzymes and target proteins for proteasome-mediated degradation. Certain SIAH1 
polypeptides are involved in targeting beta-catenin for degradation (Matsuzawa, S 
JC (2001) Molec Cell 7: 915-926 and Liu, J et al (2001) Molec Cell 7: 
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927-936). Accordingly, certain POSH polypeptides are involved in the targeting of 
beta-catenin for degradation. Certain SIAH1 polypeptides are E3 ubiquitin ligases 
and regulate the ubiquitination and degradation of synaptophysin (Wheeler, TC et al. 
(2002) J Biol Chem 277: 10273-92). Accordingly, certain POSH polypeptides are 
5 involved in the ubiquitination and degardation of synaptophysin. Certain SIAH1 
polypeptides regulate the protein, DCC (deleted in colorectal cancer), via the 
ubiquitin-proteosome pathway (Hu, G et al. (1997) Genes Dev 11: 2701-14). 
Accordingly, certain POSH polypeptides are involved in the ubiquitination and 
degardation of DCC. Certain SIAH1 polypeptides are a target of activation of p53 
and are upregulated by p53, and certain SIAH1 polypeptides are involved in 
apoptosis, tumor suppression, as well as vertebrate development (Maeda, A et al 
(2002) FEBS Lett 512: 223-226; Hu, G et al (1997) Genomics 46:103-111; and 
Nemani, M e t al ( 1 996) P roc N atl A cad S ci U S A 9 3 : 9 039-9042). Accordingly, 
certain POSH polypeptides may be a target of p53 activation, and certain POSH 
polypeptides may be involved in apoptosis and tumor suppression. 

The term SIAH1 is used herein to refer to various naturally occurring SIAH1 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SIAH1 (e.g., 
SEQ ID NOs: 271-272, 135-141). The term specifically includes human SIAH1 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with SYNE1. Accordingly, the application provides 
complexes comprising POSH and SYNE1. In one aspect, the application relates to 
the discovery that POSH binds directly with SYNE1 . This interaction was identified 
by Applicants in a yeast 2-hybrid assay. SYNE1 polypeptides are synonymous with 
Syne-1, myne-1, and nesprin-1 polypeptides. Syne-1 polypeptides are associated 
with nuclear envelopes in skeletal, cardiac, and smooth muscle cells. Syne-1 
polypeptides contain multiple spectrin repeats. In muscle, myne-1 expression is 
observed in the inner nuclear envelope, and myne-1 has been shown to interact with 
the inner nuclear membrane protein lamin A/C. Syne-1 also associates with the 
nuclear envelope protein, emerin. Syne-1 polypeptides may be involved in 
maintaining nuclear organization and structural integrity, and certain Syne-1 
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polypeptides may be involved in the migration of myonuclei in myotubes and/or 
their anchoring at the postsynaptic apparatus. (See, for example, Apel et al (2000) J 
Biol Chem 275:31986-95; Zhang, Q et al (2001) J Cell Sci 114:4485-98; Zhang, Q 
et al (2002) Genomics 80:473-81; and Mislow, JM et al (2002) J Cell Sci 115 (Pt 

5 l):61-70). Accordingly, certain POSH polypeptides may interact with the lamin 
A/C polypeptides and/or emerin polypeptides. Also, certain POSH polypeptides 
may be involved in mamtaining nuclear organization and structural integrity, and 
certain POSH polypeptides may be involved in the migration of myonuclei in 
myotubes and/or their anchoring at the postsynaptic apparatus. 

l0 The term SYNE1 is used herein to refer to various naturally occurring 

SYNE1 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SYNE1 
(e.g., SEQ ID NOs: 295-307, 183-201). The term specifically includes human 
SYNE1 nucleic acid and amino acid sequences and the sequences presented in 

15 Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with TTC3. Accordingly, the application provides complexes 
comprising POSH and TTC3. In one aspect, the application relates to the discovery 
that POSH binds directly with TTC3 This interaction was identified by Applicants 
20 in a yeast 2-hybrid assay. Certain TTC3 polypeptides are synonymous with the 
proteins, TPRDI, TPRDII, TRPDIH, TPRD and DCRR1 and may be involved in the 
pathogenesis o f certain characteristics o f D own syndrome, s uch a s m orphological 
features, hypotonia, and mental retardation (Tsukahar, F et al (1996) J Biochem 
(Tokyo) 120: 820-827; Ohira, M et al (1996) DNA Res 3: 9-16; Dahmane, N et al 
25 (1998) Genomics 48: 12-23; and Eki, T et al (1997) DNA Seq 7:153-164). 

The term TTC3 is used herein to refer to various naturally occurring TTC3 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring TTC3 (e.g., SEQ 
ID NOs: 308-312, 202-207). The term specifically includes human TTC3 nucleic 
30 acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
nolvnentide interacts with VCY2IP1. Accordingly, the application provides 
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complexes comprising POSH and VCY2IP1. In one aspect, the application relates 
to the discovery that POSH binds directly with VCY2IP1. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. VCY2IP1 is synonymous with 
VCY2IP-1, which has been shown to interact with the testis-specific protein, VCY2. 
5 VCY2IP1 is also synonymous with C19orf5, which has been shown to interact with 
the tumor suppressor, RASSF1, suggesting a role for C19orf5 in apoptosis and 
tumor suppression (In Vitro Cell Dev Biol Anim (2002) 38:582-94). C19orf5 also 
demonstrates a strong homology to microtubule-associated proteins (Genomics 
(2002) 79:124-6). Accordingly, POSH may play a role in apoptosis and tumor 
10 suppression. 

The term VCY2IP1 is used herein to refer to various naturally occurring 
VCY2IP1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
VCY2EP1 (e.g., SEQ ID NOs: 315-323, 214-222). The term specifically includes 

15 human VCY2IP1 nucleic acid and amino acid sequences and the sequences 
presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with MSTP028. hi one aspect, the application relates to the 
discovery that POSH binds directly with MSTP028. This interaction was identified 

20 by Applicants in a yeast 2-hybrid assay. In part, the present application relates to the 
discovery that a POSH-AP, MSTP028, is involved in the maturation of an envelope 
virus, such as HIV. Certain MSTP028 polypeptides contain one or more BTB/POZ 
domains that are generally involved in dimerization. Accordingly the application 
provides complexes comprising POSH and MSTP028, optionally in a dimeric form. 

25 The term MSTP028 is used herein to refer to various naturally occurring MSTP028 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring MSTP028 (e.g., 
SEQ ID NOs: 255-256, 90-94). The term specifically includes human MSTP028 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

30 In certain embodiments, the application relates to the discovery that a POSH 

polypeptide interacts with SNX1. Accordingly, the application provides complexes 
comprising POSH and SNX1 . In one aspect, the application relates to the discovery 
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that POSH binds directly with SNX1. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. SNX1 is a member of the sorting nexin (SNX) protein 
family, which is implicated in regulating membrane traffic. SNX1 is a membrane 
associated p rotein t hat h as b een s hown t o b e i nvolved w ith t argeting r eceptors t o 
5 lysosomal degradation. SNX1 has been shown to bind to the C-tenninal tail of the 
D5 dopamine receptor (Mol Cell Biol (1998) 18: 7278-87). Accordingly, in certain 
aspects POSH may associate with the D5 dopamine receptor. SNX1 is involved in 
regulating the targeting of internalized epidermal growth factor receptors for 
lysosomal degradation ( Science ( 1996) 2 72:1008-1010). In c ertain aspects, P OSH 
10 may be involved in targeting proteins for degradation to the lysosome. SNX1 has 
also been found to be involved in sorting PARI, a G-protein coupled receptor for 
thrombin (Mol Cell Biol (2002) 13:1965-76). It has further been demonstrated that 
SNX1 functions in regulating trafficking in the endosome compartment via 
recognition of phosphorylated phosphatidylinositol through the phox homology 
1 5 domain (PX domain) of SNX1 (Proc Natl Acad Sci (2002) 99:6767-72). 

The term SNX1 is used herein to refer to various naturally occurring SNX1 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SNX1 (e.g., SEQ 
ID NOs: 281-286, 152-161). The term specifically includes human SNX1 nucleic 
20 acid and amino acid sequences and the sequences presented in Figure 36. 

In additional embodiments, the application relates to the discovery that a 
POSH polypeptide interacts with SNX3. Accordingly, the application provides 
complexes comprising POSH and SNX3. In one aspect, the application relates to 
the discovery that POSH binds directly with SNX3. This interaction was identified 
25 by Applicants in a yeast 2-hybrid assay. SNX3 is also a member of the SNX protein 
family. SNX3 has been shown to associate with the early endosome through its PX 
domain, a domain capable of interaction with phosphatidylinositol-3-phosphate (Nat 
Cell Biol (2002) 3:658-66). Accordingly, POSH may be involved in membrane 
traffic at the early endosome. 
30 The term SNX3 is used herein to refer to various naturally occurring SNX3 

homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SNX3 (e.g., SEQ 
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ID NOS: 287-290, 162-174). The term specifically includes human SNX3 nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In further embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with ATP6V0C. Accordingly, the application provides 
complexes comprising POSH and ASTP6V0C. In one aspect, the application relates 
to the discovery that POSH binds directly with ATP6V0C. This interaction was 
identified by Applicants in a yeast 2-hyhrid assay. ATP6V0C, vacuolar-H(+> 
ATPase, is a large multimeric protein composed of at least twelve distinct subunits 
and it is involved in the H(+) transport across cellular membranes. ATP6V0C is 
synonymous with ATP6L. Treatment with anticancer agents has been shown to 
enhance ATP6L expression (Cytogenet Genome Res (2002) 97:111-5; J Biol Chem 
(2002) 277:36534-43). 

The term ATP6V0C is used herein to refer to various naturally occurring 
ATP6V0C homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
ATP6V0C (e.g., SEQ ID NOs: 225-226, 345-351). The term specifically includes 
human ATP6V0C nucleic acid and amino acid sequences and the sequences 
presented in Figure 36, 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with PPP1CA. Accordingly, the application provides 
complexes comprising POSH and PPP1CA. In one aspect, the application relates to 
the discovery that POSH binds directly with PPP1CA. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. PPP1CA is the protein 
phosphatase type 1 alpha catalytic subunit. The genetic and expression status of the 
PPP1CA gene was examined in 55 human cancer cell lines and found to be 
ubiquitously expressed and lacking in genetic variation, suggesting an essential role 
for PPP1CA in the growth of cancer cells (Int J Oncol (2001) 18:817-24). 

The term PPP1CA is used herein to refer to various naturally occurring 
PPP1CA homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring PPP1CA 
(e;g., SEQ ID NOs: 261-263, 101-110). The term specifically includes human 
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PPP1CA nucleic acid and amino acid sequences and the sequences presented in 
Figure 36. 

The application further relates to the discovery that a POSH polypeptide 
interacts with DDEF1. Accordingly, the application provides complexes comprising 
5 POSH and DDEF1 . In one aspect, the application relates to the discovery that POSH 
binds directly with DDEF1 . This interaction was identified by Applicants in a yeast 
2-hybrid assay. DDEF1 is a putative candidate gene associated with Meckel-Gruber 
syndrome (MKS), the most common monogenic cause of neural tube defects (Hum 
Genet (2002) 111:654-61). 

10 The term DDEF1 is used herein to refer to various naturally occurring 

DDEF1 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring DDEF1 
(e.g., SEQ ID NOs: 233-237, 48-54). The term specifically includes human DDEF1 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

15 In certain embodiments, the application relates to the discovery that a POSH 

polypeptide interacts with PACS-L Accordingly, the application provides 
complexes comprising POSH and PACS-1. In one aspect, the application relates to 
the discovery that POSH binds directly with PACS-1. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. PACS-1 is a cytosolic sorting 

20 protein that directs localization of membrane proteins in the TGN/endosomal 
system. PACS-1 is a cytosolic protein involved in controlling the correct subcellular 
localization of integral membrane proteins that contain acidic cluster sorting motifs, 
such as furin and HIV-1 Nef, and PACS-1 has been shown to interact with the 
adaptor complexes AP-1 and AP-3 (EMBO J (2003) 22:6234-44; EMBO J (2001) 

25 20:2191-201). Furthermore, PACS-1 polypeptides have been shown to interact with 
Nef and through this interaction, by a PI3K-dependent proces, MHC class I 
molecules are downregulated by Nef (Cell (2002) 11:853-66). Accordingly, POSH 
may be involved in Nef-mediated downregulation of MHC class I molecules in a 
cell infected with HIV-1. Additionally, PACS-1 interacts with the HIV-1 protein, 

30 Vpu. Vpu expresses an acidic amino acid sorting motif that is required for TGN 
localization through a retroviral process mediated by PACS-1 (Wan, L et al (1998) 



-47- 



WO 2004/078130 



PCT/US2004/006308 



Cell 94:205-216). Accordingly, in certain aspects, POSH may associate with Vpu 
through its interaction with PACS-1 . 

The term PACS-1 is used herein to refer to various naturally occurring 
PACS-1 homologs, as well as functionally similar variants and fragments that retain 
5 at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring PACS-1 
(e.g., SEQ ID NOs: 362-366, 95-100). The term specifically includes human PACS- 
1 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with EPS8L2. Accordingly, the application provides 
L0 complexes comprising POSH and EPS8L2. In one aspect, the application relates to 
the discovery that POSH binds directly with EPS8L2. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. EPS8L2 is an eps8-related 
protein. Eps8 forms a multimeric complex with Sos-1, Abil and PI3K that is 
required for Rac activation leading to actin remodelling. EPS8L2 has been shown to 
15 interact with Abil and Sos-1. EPS8L2 also has been shown to localize to PDGF- 
induced F-actin-rich ruffles and to restore receptor tyrosine kinase mediated actin 
remodeling when expressed in eps8-/- fibroblasts (Mol Biol Cell (2004) 15:91-8). 

The term EPS8L2 is used herein to refer to various naturally occurring 
EPS8L2 homologs, as well as functionally similar variants and fragments that retain 
20 at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring EPS8L2 
(e.g., SEQ ID NOs: 239, 58-60). The term specifically includes human EPS8L2 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

The application additionally relates to the discovery that a POSH polypeptide 
interacts with HTP55. Accordingly, the application provides complexes comprising 
25 POSH and H1P55. In one aspect, the application relates to the discovery that POSH 
binds directly with HIP55. This interaction was identified by Applicants in a yeast 
2-hybrid assay. HIP55 is a cytoplasmic adaptor protein that has been shown to bind 
to the cytoplasmic tail of the CD2v protein of African swine fever virus ( J G en 
Virol (2004) 85:119-30). HIP55 (synonymous with mAbpl and SH3P7) comprises 
30 an SH3 domain and through its SH3 domain, associates with dynamin (J Cell Biol 
(2001) 153:351-66; Biochem Biophys Res Commun (2003) 301:704-10). 
Accordingly, in certain aspects, POSH may associate with dynamin through its 
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interaction with HIP55. HBP55 has also been shown to be important for receptor 
mediated e ndocytosis o f t he t ransferrin r eceptor ( Biochem Biophys R es C ommun 
.^(2003)301:704-10). 

The term HDP55 is used herein to refer to various naturally occurring HIPS 5 
5 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring HEP55 (e.g., SEQ 
ID NOs: 390-394, 377-385). The term specifically includes human HIP55 nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
10 polypeptide interacts with CENTB1. Accordingly, the application provides 
complexes comprising POSH and CENTB1. In one aspect, the application relates to 
the discovery that POSH binds directly with CENTB1. This interaction was 
identified by Applicants in a yeast 2-hybrid assay. CENTB1 is synonymous with 
ACAP1. ACAP1 is an ARF GTPase activating protein (ARF GAP). ACAP1 can 
15 function as a GAP for ARF1 and ARF6 (J Biol Chem (2002) 277:7962-9). 

The term CENTB1 is used herein to refer to various naturally occurring 
CENTB1 homologs, as well as functionally similar variants and fragments that 
retain at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring 
CENTB1 (e.g., SEQ ID NOs: 231-232, 37-47). The term specifically includes 
20 human CENTB1 nucleic acid and amino acid sequences and the sequences presented 
in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with EEF3S3. Accordingly, the application provides 
complexes comprising POSH and EIF3S3. In one aspect, the application relates to 

25 the discovery that POSH binds directly with EIF3S3. This interaction was identified 
by Applicants in a yeast 2-hybrid assay. EIF3S3 is elevated in certain hepatocellular 
carcinomas and in prostate cancer (Hepatology (2003) 38:1242-9; Am J Pathol 
(2001) 159:2081-84). It has also been demonstrated that EIF3S3 is often amplified 
and o verexpressed i n b reast c ancer ( Genes C hromosomes C ancer. (2000) 2 8:203- 

30 210). 

The term EIF3S3 is used herein to refer to various naturally occurring 
EIF3S3 homologs, as well as functionally similar variants and fragments that retain 



-49- 



WO 2004/078130 



PCT/US2004/006308 



at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring EIF3S3 
(e.g., SEQ ID NOs: 238, 55-57). The term specifically includes human EIF3S3 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the discovery that a POSH 
5 polypeptide interacts with SRA1. Accordingly, the application provides complexes 
comprising POSH and SRA1 . In one aspect, the application relates to the discovery 
that POSH binds directly with SRA1. This interaction was identified by Applicants 
in a yeast 2-hybrid assay. SRA1 is a transcriptional coactivator, steroid receptor 
RNA activator 1. SRA is selective for steroid hormone receptors and mediates 

10 transactivation via their amino-terminal activation function (Cell (1999) 97:17-27). 
The term SRA1 is used herein to refer to various naturally occurring SRA1 
homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring SRA1 (e.g., SEQ 
ID NOs: 291-294, 175-182). The term specifically includes human SRA1 nucleic 

15 acid and amino acid sequences and the sequences presented in Figure 36. 

The application additionally relates to the discovery that a POSH polypeptide 
interacts with WASF1. Accordingly, the application provides complexes 
comprising POSH and WASF1. In one aspect, the application relates to the 
discovery that POSH binds directly with WASF1 . This interaction was identified by 

20 Applicants in a yeast 2-hybrid assay. WASF1 is a member of the Wiskott-Aldrich 
syndrome protein (WASP) family of proteins. WASF-1 has been shown to regulate 
cortical actin filament reorganization in response to extracellular stimuli. WASF1 is 
synonymous with WAVE1 and is an actin regulatory protein. It has been shown that 
Ras and the adaptor protein Nek activate actin nucleation through WAVE1 (Nature 

25 (2002)418:790-3). 

The term WASF1 is used herein to refer to various naturally occurring 
WASF1 homologs, as well as functionally similar variants and fragments that retain 
at least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring WASF1 
(e.g., SEQ ID NOs: 389, 375-376). The term specifically includes human WASF1 

30 nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

The application additionally relates to the discovery that a POSH polypeptide 
interacts with SPG20. Accordingly, the application provides complexes comprising 
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POSH and SPG20. In one aspect, the application relates to the discovery that POSH 
binds directly with SPG20. This interaction was identified by Applicants in a yeast 
2-hybrid assay. SPG20 is synonymous with spartin, and mutation in the gene has 
been implicated in Troyer syndrome, an autosomal recessive complicated hereditary 
5 spastic paraplegia. Comparative sequence analysis has shown that spartin shares 
similarity with molecules involved in endosomal trafficking (Nat Genet (2002) 
31:347-8). 

The term SPG20 is used herein to refer to various naturally occurring SPG20 
homologs, as well as functionally similar variants and fragments that retain at least 

10 80%, 90%, 95%, or 99% sequence identity to a naturally occurring SPG20 (e.g., 
SEQ ID NOs: 386-388, 367-374). The term specifically includes human SPG20 
nucleic acid and amino acid sequences and the sequences presented in the Figure 36. 

In further embodiments, the application relates to the discovery that a POSH 
polypeptide interacts with HLA-A. Accordingly, the application provides 

15 complexes comprising POSH and HLA-A. In one aspect, the application relates to 
the discovery that POSH binds directly with HLA-A. This interaction was identified 
by Applicants in a yeast 2-hybrid assay. In additional aspects, the application relates 
to the discovery that a POSH polypeptide interacts with HLA-B. Accordingly, the 
application provides complexes comprising POSH and HLA-B. In one aspect, the 

20 application relates to the discovery that POSH binds directly with HLA-B. This 
interaction was identified by Applicants in a yeast 2-hybrid assay. HLA-A and 
HLA-B are MHC class I molecules. HLA-A and HLA-B molecules are 
downregulated in the progression of AIDS, and this downregulation is associated 
with the activity of HIV-1 Nef. 

25 The term HLA-A is used herein to refer to various naturally occurring HLA- 

A homologs, as well as functionally similar variants and fragments that retain at 
least 8 0%, 9 0%, 9 5%, o r 9 9% se quence i dentity t o a n aturally o ccurring H LA-A 
(e.g., SEQ ID NOs: 253, 87-88). The term specifically includes human HLA-A 
nucleic acid and amino acid sequences and the sequences presented in Figure 36. 

30 The term HLA-B is used herein to refer to various naturally occurring HLA- 

B homologs, as well as functionally similar variants and fragments that retain at 
least 80%, 90%, 95%, or 99% sequence identity to a naturally occurring HLA-B 
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(e.g., SEQ ID NOs: 254, 89). The term specifically includes human HLA-B nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In certain aspects, the application relates to the discovery that a POSH 
polypeptide interacts with a ubiquitin-conjugating enzyme (E2). An exemplary E2 
5 may include, but are not limited to, UBCSa, UBC5c, UBC6, and UBC13. UBC13 is 
often found in a heterodimer complex with a Ub conjugating enzymer variant (UEV) 
protein, such as, for example, UEV la. (See Hofinann and Pickart, Noncanonical 
MMS2-Encoded Ubiquitin-Conjugating Enzyme Functions in Assembly of Novel 
Ubiquitin Chains for DNA Repair, Cell 96: 645-653 (1999), McKenna et aL, 2002, 

10 Energetics and Specificity of Interactions within Ub-Uev-Ubcl3 Human Ubiquitin 
Conjugating Complexs, Biochemistry. Vol. 42. pp.7922-7930, and Ulrich, 2003, 
Protein-Protein Interactions within an E2-RING Finger Complex, The Jurnal of 
Biological Chemistry, Vol. 278. No 9. pp. 7051-7058). UVE proteins share 
significant sequence and structural similarities with E2s, yet lack the requisite active 

15 site cystine of the classical E2 protein family. 

Generally, UBC5 conjugates ubiquitin to Lysine 48 in a target protein, a 
signal that marks the protein for degredation by the 26 S proteosome. In constrast, 
UBC13/UEVla conjugates ubiquitin to Lysine 63 residue in a target protein, which 
is not a degradation signal. Instead, ubiquitin conjugated at Lysine 63 has been 

20 implicated in diverse biological processes, including, for example, DNA damage 
repair, endocytosis, ribosome biogenesis, mitochondrial inheritance, and NFkB 
signaling (See Ulrich, 2003). The UBC13/UEVla has been shown to work with two 
other RJNG-ubiquitin ligases, TRAF6 and RAD5. (See Ulrich, 2003). TRAF6- 
UBC13-UEVla complex ubiquitinates TRAF6 (self-ubiquitination), thus enabling it 

25 to activate a kinase cascade. 

Without being bound to theory, it appears that UBCSa, UBC5c and UBC6 
may work with POSH in one pathway, while UBC13/UEVla work with POSH in 
another distinct pathway. This is supported by the fact that UBC5/6 marks POSH 
for degradation by conjugating ubiquitin at Lysine 48, whereas UBC13/UEVla 

30 marks POSH for purposes other than degradation by conjugating ubiquitin at Lysine 
63. Thistheoryis further supported by the factthatUBC5a,UBC5candUBC6 
share high sequence similarities. 
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Accordingly, in certain aspects, the present application relates to an isolated, 
purified or recombinant complex comprising a POSH polypeptide and a UBC13. In 
certain aspects, the present application relates to an isolated, purified or recombinant 
complex comprising: a polypeptide comprising a domain that is at least 90% 
5 identical to a POSH RING domain, and a POSH-AP comprising an E2. An 
exemplary POSH associated protein E2 include, for example, is UBC13. UBC13 
may beinah eterodimer c omplex w ith a Ub c onjugating enzymer v ariant (UEV) 
protein, such as, for example, UEV la. 

The term "UBC13" and is used herein to refer to full-length UBC13, any 

10 splice variants thereof, various naturally occurring UBC13 homologs, as well as 
functionally similar variants and fragments that retain at least 80%, 90%, 95%, or 
99% sequence identity to a naturally occurring UBC13 (e.g., SEQ ID NOs: 313, 
208-210). The term specifically includes UBC13 nucleic acid and amino acid 
sequences and the sequences presented in Figure 36. 

15 In certain embodiments, the application relates to the interaction between an 

ARF5 polypeptide and a POSH polypeptide. ARF5 is a member of the ARF gene 
family. The ARF proteins stimulate the in vitro ADP-ribosyltransferase activity of 
cholera toxin. ARF proteins play a role in vesicular trafficking in vivo. ARFs are 
members of the Ras GTPase superfamily. ARFs activate specific PLDs. 

20 Mammalian ARFs are divided into three classes based on size, amino acid sequence, 
gene structure, and phylogenetic analysis. ARF1 is in class I, and ARFS is in class 

Li certain embodiments, the application relates to the interaction between an 
ARF1 polypeptide and a POSH polypeptide. ARF1 is a small G protein involved in 

25 vesicular trafficking. The assembly/disassembly cycle of the coat protein I (COPI) 
on Golgi membranes is coupled to the GTP/GDP cycle of ARF1 (Nature (2003) 
426:563-6). ARF1 has been implicated in mitotic Golgi disassembly, chromosome 
segregation, and cytokinesis (Proc Natl Acad Sci (2003) 100:13314-9). ARF1 has 
been shown to bind to the 5-HT2A receptor, a G protein coupled receptor (GPCR) 

30 (Mol Pharmacol (2003) 64:1239-50). 

The term ARF-1 is used herein to refer to various naturally occurring ARF-1 
homologs, as well as functionally similar variants and fragments that retain at least 
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80%, 90%, 95%, or 99% sequence identity to a naturally occurring ARF-1 (e.g. SEQ 
ID NOs: 223, 325-339). The term specifically includes human ARF-1 nucleic acid 
and amino acid sequences and the sequences presented in Figure 36. 

The term ARF-5 is used herein to refer to various naturally occurring ARF-5 
5 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring ARF-5 (e.g., 
SEQ ID NOs: 224, 340-344). The term specifically includes human ARF-5 nucleic 
acid and amino acid sequences and the sequences presented in Figure 36. 

In certain embodiments, the application relates to the inhibition of viral 

10 maturation by modulation of an activity associated with a dynamin II polypeptide. 
Dynaminll is a large GTP-binding protein that is involved in endocytosis and in 
vesicle formation at the trans-GoIgi network. Dynamin II contains a pleckstrin 
homology domain (PHD) and a proline-rich domain (PRD). Dynamin II plays an 
important role in vesicle formation at the plasma membrane, trans-Golgi network, 

15 and various other intracellular organelles. Accordingly, disrupting the activity of a 
dynamin II polypeptide or the interaction between a POSH polypeptide and a 
dynamin II polypeptide (e.g., by reducing POSH protein levels or alternatively, 
reducing dynamin II protein levels, through RNAi) may disrupt the activity of 
dynamin II in the secretory pathway and prevent the secretion of viral proteins, such 

20 as, for example, HBV proteins. Dynamin II participates in the transport and 
secretion of HBV proteins (Abdulkarim, AS et al (2003) J. Hepat. 38:76-83). 
Accordingly, in certain embodiments, inhibition of POSH adversely effects the 
transport and release of HBV proteins. 

In certain embodiments, the application relates to the inhibition of dynamin 

25 activity, in particular the inhibition of the activity of dynamin II, a member of the 
dynamin family of proteins. In certain embodiments, the application relates to 
inhibition of dynamin II activity, which inhibition disrupts the transport and 
secretion of HBV proteins. The term dynamin II is used herein to refer to full- 
length, human dynamin II as well as various naturally occurring dynamin II 

30 homologs, as well as functionally similar variants and fragments that retain at least 
80%, 90%, 95%, or 99% sequence identity to a naturally occurring dynamin II (e.g., 
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public gi number: 1196422, public gi number: 1706539, public gi number: 1 196423, 
and public gi number: 1363934). 

In certain embodiments, the application relates to the inhibition of viral 
maturation by modulation of an activity associated with a Vpu polypeptide. Vpu is 

5 an HIV-1 encoded ion channel, which, among other tasks in the HIV-1 life cycle, is 
necessary for efficient virus budding (Schubert, U et al ( 1995) J. Virol. 69:7699- 
7711). Vpu may function at the trans Golgi network (TGN). Vpu expresses an 
acidic amino acid sorting motif that is required for TGN localization through a 
retroviralprocessmediatedbythePOSH-AP,PACS-l (Wan, L et al ( 1998) Cell 

10 94:205-216). Moreover, the phenotype conferred by human POSH knockdown is 
similar to that observed in cells expressing HIV-1 lacking Vpu where viruses also 
accumulate in intracellular membranes (Klimkait, T et al (1990) J. Virol. 64:621- 
629). 

Vpu regulates virus release from a post-endoplasmic reticulum compartment, 
1 5 such as possibly the TGN, by an ion channel activity mediated' by its transmembrane 
anchor. Vpu also induces the selective down regulation of host cell receptor 
proteins such as CD4 and major histocompatibility complex class I molecules, in a 
process involving its cytoplasmic tail. Furthermore, Vpu-mediated degradation of 
CD4 is dependent on an intact ubiquitin-conjugating system. (See Schubert, U et al 
20 (1998) J. Virol. 72:2280-8). In certain embodiments of the present invention, Vpu- 
mediated degradation of a protein such as CD4 may involve a ubiquitin-conjugating 
system that includes a POSH polypeptide or a POSH-AP, such as, for example, Cbl- 
b. 

Vpu nucleic acid and the corresponding amino acid sequence encoded 
25 thereby are exemplified by the Vpu discussed in Strebel, K et al (1988) 241:1221- 
1223. The term Vpu is used herein to refer as well to Vpu of other HIV-1 isolates, 
such as the Vpu disclosed in GenBank, accession number U51190, and the Vpu 
disclosed in GenBank, accession number U52953. The term Vpu is used herein to 
refer as well to various naturally occurring Vpu homologs, as well as functionally 
30 similar variants and fragments that retain at least 80%, 90%, 95%, or 99% sequence 
identity to a naturally occurring Vpu. 

thods and Compositions for Treating POSH-associated Diseases 
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In certain aspects, the application provides methods and compositions for 
treatment of POSH-associated diseases (disorders), including cancer and viral 
disorders, as well as disorders associated with unwanted apoptosis,- including, for 
example a variety of neurodegenerative disorders, such as Alzheimer's disease. 
5 In certain embodiments, the application relates to viral disorders (e.g., viral 

infections), and particularly disorders caused by retroid viruses, RNA viruses and/or 
envelope viruses. In view of the teachings herein, one of skill in the art will 
understand that the methods and compositions of the application are applicable to a 
wide range of viruses such as, for example, retroid viruses, RNA viruses, and 
10 envelope viruses. In a preferred embodiment, the present application is applicable to 
retroid viruses. In a more preferred embodiment, the present application is further 
applicable to retroviruses (retroviridae). In another more preferred embodiment, the 
present application is applicable to lentivirus, including primate lentivirus group. In 
a most preferred embodiment, the present application is applicable to Human 
15 Immunodeficiency virus (HIV), Human Immunodeficiency virus type-1 (HTV-1), 
Hepatitis B Virus (HBV) and Human T-cell Leukemia Virus (HTLV). 

While not intended to be limiting, relevant retroviruses include: C-type 
retrovirus which causes lymphosarcoma in Northern Pike, the C-type retrovirus 
which infects mink, the caprine lentivirus which infects sheep, the Equine Infectious 
20 Anemia Virus (EIAV), the C-type retrovirus which infects pigs, the Avian Leukosis 
Sarcoma Virus (ALSV), the Feline Leukemia Virus (FeLV), the Feline Aids Virus, 
the Bovine Leukemia Virus (BLV), Moloney Murine Leukemia Virus (MMuLV), 
the Simian Leukemia Virus (SLV), the Simian hnmuno-deficiency Virus (SIV), the 
Human T-cell Leukemia Virus type-I (HTLV-I), the Human T-cell Leukemia Virus 
25 type-n (HTLV-H), Human Immunodeficiency virus' type-2 (HTV-2) and Human 
Immunodeficiency virus type-1 (HTV-1). 

The method and compositions of the present application are further 
applicable to RNA viruses, including ssRNA negative-strand viruses and ssRNA 
positive-strand viruses. The ssRNA positive-strand viruses include Hepatitis C 
30 Virus (HCV). In a preferred embodiment, the present application is applicable to 
mononegavirales, including filoviruses. Filoviruses farther include Ebola viruses 
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and Marburg viruses. In another preferred embodiment, the present invention is 
applicable to flavi viruses, including West Nile Virus (WNV). 

Other RNA viruses include picornaviruses such as enterovirus, poliovirus, 
coxsackievirus and hepatitis A virus, the caliciviruses, including Norwalk-like 
5 viruses, the rhabdoviruses, including rabies virus, the togaviruses including 

alphaviruses, Semliki Forest virus, denguevirus, yellow fever virus and rubella virus, 
the orthomyxoviruses, including Type A, B, and C influenza viruses, the 
bunyaviruses, including the Rift Valley fever virus and the hantavirus, the 
filoviruses such as Ebola virus and Marburg virus, and the paramyxoviruses, 
10 including mumps virus and measles virus. Additional viruses that may be treated 
. include herpes viruses. 

The methods and compositions of the present application are further 
applicable to hepatotrophic viruses, including HAV, HBV, HCV, HDV, and HEV. 
In certain aspects, the application relates to a method of inhibiting a hepatotrophic 
15 virus, comprising administering a POSH inhibitor to a subject in need thereof. In 
further aspects, the application relates to a method of treating a viral hepatitis 
infection, comprising administering a POSH inhibitor to a subject in need thereof. 
A viral hepatitis infection may be caused by a hepatotrophic virus, such as HAV, 
HBV, HCV, HDV, or HEV. In certain embodiments, the application relates to a 
20 method of treating an HBV infection by administering a POSH inhibitor to a subject 
in need thereof. 

In other embodiments, the application relates to methods of treating or 
preventing cancer diseases. The terms "cancer/' "tumor," and "neoplasia" are used 
interchangeably herein. As used herein, a cancer (tumor or neoplasia) is 

25 characterized by one or more of the following properties: cell growth is not 

regulated by the normal biochemical and physical influences in the environment; 
anaplasia (e.g., lack of normal coordinated cell differentiation); and in some 
instances, metastasis. Cancer diseases include, for example, anal carcinoma, bladder 
carcinoma, breast carcinoma, cervix carcinoma, chronic lymphocytic leukemia, 

30 chronic myelogenous leukemia, endometrial carcinoma, hairy cell leukemia, head 
and neck carcinoma, lung (small cell) carcinoma, multiple myeloma, non-Hodgkin's 
lymphoma, follicular lymphoma, ovarian carcinoma, brain tumors, colorectal 
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carcinoma, hepatocellular carcinoma, Kaposi's sarcoma, lung (non-small cell 
carcinoma), melanoma, pancreatic carcinoma, prostate carcinoma, renal cell 
carcinoma, and soft tissue sarcoma. Additional cancer disorders can be found in, for 
example, Isselbacher et al. (1994) Harrison's Principles of Internal Medicine 1814- 

5 1877, herein incorporated by reference. 

In a specific embodiment, anticancer therapeutics of the application are used 
in treating a POSH-associated cancer. As described herein, POSH-associated 
cancers include, but are not limited to, the thyroid carcinoma, liver cancer 
(hepatocellular cancer), lung cancer, cervical cancer, ovarian cancer, renal cell 

10 carcinoma, lymphoma, osteosacoma, liposarcoma, leukemia, breast carcinoma, and 
breast adeno-carcinoma. 

Preferred antiviral and anticancer therapeutics of the application can function 
by disrupting the biological activity of a POSH polypeptide or POSH complex in 
viral maturation. Certain therapeutics of the application function by disrupting the 

1 5 activity of a POSH-AP (e.g., HERPUD1) in viral maturation. Certain therapeutics 
of the application function by disrupting the activity of POSH by inhibiting the 
ubiquitin ligase activity of a POSH polypeptide. In certain embodiments of the 
application, a therapeutic of the application inhibits the ubiquitination of a POSH- 
AP, such as for example the ubiquitination of HERPUD1. 

20 In other embodiments, the application relates to methods of treating or 

preventing n etiological disorders. In o ne a spect, t he i nvention p rovides m ethods 
and compositions for the identification of compositions that interfere with the 
function of a POSH or a POSH-AP, which function may relate to aberrant protein 
processing associated with a neurodegenerative disorder, such as for example, the 

25 processing of amyloid beta precursor protein associated with Alzheimer's disease. 
Neurological disorders include disorders associated with increased levels of amyloid 
P production, such as for example, Alzheimer's disease. Neurological disorders also 
include Parkinson's disease, Huntington's disease, schizophrenia, Niemann-Pick's 
disease, and prion-associated diseases 

30 Exemplary therapeutics of the application include nucleic acid therapies such 

as, for example, RNAi constructs (small inhibitory RNAs), antisense 
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oligonucleotides, ribozyme, and DNA enzymes. Other therapeutics include 
polypeptides, peptidomimetics, antibodies and small molecules. 

Antisense therapies of the application include methods of introducing 
antisense nucleic acids to disrupt the expression of POSH polypeptides or proteins 
5 that are necessary for POSH function. 

RNAi therapies include methods of introducing RNAi constructs to 
downregulate the expression of POSH polypeptides or POSH-APs (e.g., 
HERPUD1). In certain embodiments, RNAi therapeutics are delivered to the liver 
(e.g., to hepatocytes). Exemplary RNAi therapeutics include any one of SEQ ID 
10 NOs: 15, 16, 18, 19, 21, 22, 24 and 25. 

Therapeutic polypeptides may be generated by designing polypeptides to 
mimic certain protein domains important in the formation of POSH: POSH-AP 
complexes, such as, for example, SH3 or RING domains. For example, a 
polypeptide comprising a POSH SH3 domain such as, for example, the SH3 domain 
15 as set forth in SEQ ID NO: 30 will compete for binding to a POSH SH3 domain and 
will therefore act to disrupt binding of a partner protein. In one embodiment, a 
binding partner may be a Gag polypeptide. In another embodiment, a binding 
partner may be Rac. In a further embodiment, a polypeptide that resembles an L 
domain may disrupt recruitment of Gag to the POSH complex. 
20 In view of the specification, methods for generating antibodies directed to 

epitopes of POSH and POSH-APs are known in the art. Antibodies may be 
introduced into cells by a variety of methods. One exemplary method comprises 
generating a nucleic acid encoding a single chain antibody that is capable of 
disrupting a POSH:POSH-AP complex. Such a nucleic acid may be conjugated to 
25 antibody that binds to receptors on the surface of target cells. It is contemplated that 
in certain embodiments, the antibody may target viral proteins that are present on the 
surface of infected cells, and in this way deliver the nucleic acid only to infected 
cells. Once bound to the target cell surface, the antibody is taken up by endocytosis, 
and the conjugated nucleic acid is transcribed and translated to produce a single 
30 chain antibody that interacts with and disrupts the targeted POSH:POSH-AP 

complex. Nucleic acids expressing the desired single chain antibody may also be 
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introduced into cells using a variety of more conventional techniques, such as viral 
transfection (e.g., using an adenoviral system) or liposome-mediated transfection. 

Small molecules of the application may be identified for their ability to 
modulate the formation of POSH.POSH-AP complexes. 

Certain embodiments of the disclosure relate to use of a small molecule as an 
inhibitor of POSH. Examples of such small molecules include the following 

compounds: 

Compound CAS 27430-18-8: 




O 



10 



Compound CAS 1631-29-4: 
O 




Compound CAS 503065-65-4: 



15 




Compound CAS 414908-08: 
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Compound CAS 415703-60-5: 
Br 




Br 



5 Compound CAS 77367-94-3: 




CI 



Compound CAS 154184-27-7: 

10 

In certain embodiments, compounds useful in the instant compositions and 
methods include heteroarylmethylene-dihydro-2,4,6-pyrimidinetriones and their 
thione analogs. Preferred heteroaryl moieties include 5-membered rings such as 
thienyl, furyl, pyrrolyl, oxazolyl, thiazolyl, and imidazolyl moieties. 
15 In certain embodiments, compounds useful in the instant compositions and 

methods include N-aiylmaleimides, especially N-phenylmaleimides, in which the 
phenyl group may be substituted or unsubstituted. 

In certain embodiments, compounds useful in the instant compositions and 
methods include arylallylidene-2,4-imidazolidinediones and their thione analogs. 
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Preferred aryl groups are phenyl groups, and both the aryl and allylidene portions of 
the molecule may be substituted or unsubstituted. 

In certain embodiments, compounds useful in the instant compositions and 
methods include substituted distyryl compounds and aza analogs thereof such as 
5 substituted 1,4-diphenylazabutadiene compounds. 

In certain other embodiments, compounds useful in the instant compositions 
and methods include substituted styrenes and aza analogs thereof, such as 1,2- 
diphenylazaethylenes and l-phenyl-2-pyridyl-azaethelenes. 

In yet other embodiments, compounds useful in the instant compositions and 
10 methods include N-aryl-N'-acylpiperazines. In such compounds, the aryl ring, the 
acyl substituent, and/or the piperazine ring may be substituted or unsubstituted. 

In additional embodiments, compounds useful in the instant compositions 
and methods include aryl esters of (2-oxo-benzooxazol-3-yl)-acetic acid, and 
analogs thereof in which one or more oxygen atoms are replaced by sulfur atoms. 
15 In certain embodiments, the present application contemplates use of known 

PKA modulators (e.g., inhibitors or activators) in the methods of ihibiting viral 
infection and in the methods of treating or preventing cancer. Such PKA modulators 
include any compound, peptide, nucleotide derivative, nucleoside derivative, 
polysaccharide, sugar or other substance that can inhibit the activity of protein 
20 kinase A. Many PKA inhibitors are available and may be used. For example, many 
examples of PKA inhibitors including chemical structures, methods for 
administration and pharmacological effects are listed at the Calbiochem website at 
calbiochem.com. In general, inhibitors that also significantly inhibit protein kinase C 
activity are avoided. 

25 In some embodiments, the PKA inhibitor is a nucleotide or nucleoside 

derivative. Specific examples of nucleoside or nucleotide derivatives that act as 
PKA inhibitors and that can be utilized in the disclosure include adenosine 3',5 f 
cyclic monophosphorothioate. The H-89 inhibitor is a potent PKA inhibitor that can 
be used in the disclosure. The chemical name for the H-89 inhibitor is N-[2- 

30 ((Pbromocinnamyl) amino)ethyl] isoquinolinesulfonamide. The KT5720 inhibitor 
from Calbiochem can also be used in the disclosure. Other PKA inhibitors which are 
available at from Calbiochem and can be used in the disclosure include ellagic acid 
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(also named 4,4 , ,5,5 , ,6,6 , -hexahydroxydiphenic acid 2,6,2\6 f -ditactone), piceatannol, 
1 -(5-Isoquinolinesulfonyl) methylpiperazine (H-7), N-[2-(methylamino)ethyl] 
isoquinolinesulfonamide (H-8), N-(2-aminoethyl) isoquinolinesulfonamide (H-9), 
and (5-isoquinolinesulfonyl)piperazine, 2HCI (H-100). 

The PKA inhibitor can also be a peptide inhibitor (PKI). Such a peptide 
inhibitor can be any peptide that is recognized andboundby PKAbut thatPKA 
cannot phosphorylate. An example of a peptide inhibitor is a peptide with a 
"consensus sequence" for PKA recognition but with alanine in place of serine, for 
example, a peptide with the following sequence: Xaa-Arg-Arg-Xaa-Ala-Xaa, 
wherein Xaa is any amino acid, which specifically binds to the pseudoregion of the 
regulatory domain of PKA. Myristoylated PKA inhibitor amide (14-22, Cell- 
Permeable) having the sequence Myr-N-Gly-Arg-Thr-Gly-Arg-Arg-Asn-Ala-Ile- 
NH2 is another example of a peptide inhibitor that can be utilized in the disclosure. 
A variety of other PKI peptides can be used as an inhibitor of protein kinase A in the 
practice of the disclosure. For example, several PKI peptides can be found in the 
NCBI protein database. See website at ncbi.nlm.nih.gov/ 
Genbank/GenbankOverview. One example of a human PKI peptide can be found at 
Genbank Accession No. P04541 (gi: 417194). Another example of a human PKI 
peptide is at Genbank Accession No. NP 008997 (gi: 5902020). Another PKI that 
can be used as an inhibitor has the following sequence: 
Ile-Ala-Ser-Gly-Arg-Thr-Gly-Arg-Arg-Asn-Ala-Ile-His-Asp-1 1 e-Leu-Val-SerSer- 
Ala. See published PCT application WO 03/080649. 

Further examples of protein kinase A inhibitors are provided in the following 
references: Muniz et al., Proceedings of the National Academy of Sciences USA 
1997 Dec 23; 94(26) 14461-66; Baude et al., Journal of Biological Chemistry Vol. 
269 issue 27 18128-18133 (Jul. 1994); Scott et al. 

Applicants found that POSH is phosphorylated by PKA and phosphorylation 
of POSH by PKA can inhibit POSH function, for example dissociating POSH from 
POSH interacting proteins (e.g, Rac). Therefore, in certain embodiments, the present 
disclosure also cotemplates use of PKA activators in treating or preventing a POSH- 
associated disease (e.g., viral infection or cancer). Exemplary PKA activators 
include, but are not limited to, forskolin, 8-Br-cAMP, and rolipram. 
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In additional embodiments of the application, compounds useful in the 
present application include phosphatase inhibitors. Phosphatase inhibitors useful in 
the subject application include sodium phosphate, sodium vanadate, and okadaic 
acid. In certain embodiments, the present application contemplates use of known 
5 phosphatase inhibitors in the methods of inhibiting viral infection, in the methods of 
treating or preventing cancer, and in the methods of inhibiting the progression of a 
neurodenerative disorder. Phosphatase inhibitors may be useful in inhibiting the 
activity of a POSH-AP, such as for example, PTPN12. 

For POSH-APs that are GTPases, inhibitors such as GTPgamma35S would 
10 be effective at inhibiting the GTPase activity of the POSH-AP. For example, 
inhibition of ARF1 or ARF5 could be accomplished with the use of a GTPase 
inhibitor such as GTPgamma35S, a non-hydrolyzable form of GTP. 

The generation o f nucleic acid b ased t herapeutic agents directed to P OSH 
and POSH-APs is described below. 
15 Methods for identifying and evaluating further modulators of POSH and 

POSH-APs are also provided below. 

5. RNA Interference. Ribozvmes, Antisense and Related Constructs 

In certain aspects, the application relates to RNAi, ribozyme, antisense and 
20 other nucleic acid-related methods and compositions for manipulating (typically 
decreasing) a POSH activity. Exemplary RNAi and ribozyme molecules may 
comprise a sequence as shown in any of SEQ ID Nos: 15, 16, 18, 19, 21, 22, 24 and 
25. 

In certain aspects, the application relates to RNAi, ribozyme, antisense and 
25 other nucleic acid-related methods and compositions for manipulating (typically 
decreasing) a POSH-AP activity. Specific instances of nucleic acids that may be 
used to design nucleic acids for RNAi, ribozyme, antisense are provided in Figure 
36. Additionally, nucleic acids of POSH-APs listed in Table 8 may be used to 
design nucleic acids for RNAi, ribozyme, antisense. 
30 Certain embodiments of the application make use of materials and methods 

for e ffecting k nockdown ofoneorm ore P OSH or P OSH-AP genes b y means o f 
RNA interference (RNAi). RNAi is a process of sequence-specific post- 
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transcriptional gene repression which can occur in eukaryotic cells. In general, this 
process involves degradation of an mRNA of a particular sequence induced by 
double-stranded RNA (dsRNA) that is homologous to that sequence. For example, 
the expression of a long dsRNA corresponding to the sequence of a particular single- 
5 stranded mRNA (ss mRNA) will labilize that message, thereby "interfering" with 
expression of the corresponding gene. Accordingly, any selected gene may be 
repressed by introducing a dsRNA which corresponds to all or a substantial part of 
the m RNA f or t hat gene. It appears t hat w hen a 1 ong d sRNA i s expressed, i t i s 
initially processed by a ribonuclease HI into shorter dsRNA oligonucleotides of as 

10 few as 21 to 22 base pairs in length. Furthermore, Accordingly, RNAi may be 
effected by introduction or expression of relatively short homologous dsRNAs. 
Indeed the use of relatively short homologous dsRNAs may have certain advantages 
as discussed below. 

Mammalian cells have at least two pathways that are affected by double- 

15 stranded RNA (dsRNA). In the RNAi (sequence-specific) pathway, the initiating 
dsRNA i s first b roken i nto s hort i nterfering ( si) R NAs, a s d escribed a bove. T he 
siRNAs have sense and antisense strands of about 21 nucleotides that form 
approximately 19 nucleotide si RNAs with overhangs of two nucleotides at each 3' 
end. Short interfering RNAs are thought to provide the sequence information that 

20 allows a specific messenger RNA to be targeted for degradation. In contrast, the 
nonspecific pathway is triggered by dsRNA of any sequence, as long as it is at least 
about 30 base pairs in length. The nonspecific effects occur because dsRNA 
activates two enzymes: PKR, which in its active form phosphorylates the translation 
initiation factor eEF2 to shut down all protein synthesis, and 2\ 5 y oligoadenylate 

25 synthetase (2% 5'-AS), which synthesizes a molecule that activates Rnase L, a 
nonspecific enzyme that targets all mRNAs. The nonspecific pathway may 
represent a host response to stress or viral infection, and, in general, the effects of 
the nonspecific pathway are preferably minimized under preferred methods of the 
present application. Significantly, longer dsRNAs appear to be required to induce 

30 the nonspecific pathway and, accordingly, dsRNAs shorter than about 30 bases pairs 
are preferred to effect gene repression by RNAi (see Hunter et al. (1975) J Biol 
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Chem 250: 409-17; Manche et al. (1992) Mol Cell Biol 12: 5239-48; Minks et al. 
(1979) J Biol Chem 254: 10180-3; and Elbashir et al. (2001) Nature 411: 494-8). 

RNAi has been shown to be effective in reducing or eliminating the 
expression of genes in a number of different organisms including Caenorhabditiis 
5 elegans (see e.g., Fire et al. (1998) Nature 391: 806-11), mouse eggs and embryos 
(Wianny et al. (2000) Nature Cell Biol 2: 70-5; Svoboda et al. (2000) Development 
127: 4147-56), and cultured RAT-1 fibroblasts (Bahramina et al. (1999) Mol Cell 
Biol 19: 274-83), and appears to be an anciently evolved pathway available in 
eukaryotic plants and animals (Sharp (2001) Genes Dev. 15: 485-90). RNAi has 

10 proven to be an effective means of decreasing gene expression in a variety of cell 
types including HeLa cells, NIH/3T3 cells, COS cells, 293 cells and BHK-21 cells, 
and typically decreases expression of a gene to lower levels than that achieved using 
antisense techniques and, indeed, frequently eliminates expression entirely (see Bass 
(2001) Nature 411: 428-9). In mammalian cells, siRNAs are effective at 

15 concentrations that are several orders of magnitude below the concentrations 
typically used in antisense experiments (Elbashir et al. (2001) Nature 41 1: 494-8). 

The double stranded oligonucleotides used to effect RNAi are preferably less 
than 30 base pairs in length and, more preferably, comprise about 25, 24, 23, 22, 21, 
20, 19, 18 or 17 base pairs of ribonucleic acid. Optionally the dsRNA 

20 oligonucleotides o f t he application m ay i nclude 3 ' o verhang ends. E xemplary 2 - 
nucleotide 3' overhangs maybe composed of ribonucleotide residues of any type 
and may even be composed of 2 , -deoxythymidine resides, which lowers the cost of 
RNA synthesis and may enhance nuclease resistance of siRNAs in the cell culture 
medium and within transfected cells (see Elbashir et al. (2001) Nature 41 1 : 494-8). 

25 Longer dsRNAs of 50, 75, 100 or even 500 base pairs or more may also be utilized 
in certain embodiments of the application. Exemplary concentrations of dsRNAs for 
effecting RNAi are about 0.05 nM, 0.1 nM, 0.5 nM, 1.0 nM, 1.5 nM, 25 nM or 100 
nM, although other concentrations may be utilized depending upon the nature of the 
cells treated, the gene target and other factors readily discernable the skilled artisan. 

30 Exemplary dsRNAs may be synthesized chemically or produced in vitro or in vivo 
using appropriate expression vectors. Exemplary synthetic RNAs include 21 
nucleotide RNAs chemically synthesized using methods known in the art (e.g., 
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Expedite RNA phophoramidites and thymidine phosphoramidite (Proligo, 
Germany). S ynthetic o ligonucleotides are preferably d eprotected and gel-purified 
using methods known in the art (see e.g., Elbashir et al. (2001) Genes Dev. 15: 188- 
200). Longer RNAs may be transcribed from promoters, such as T7 RNA 
5 polymerase promoters, known in the art. A single RNA target, placed in both 
possible orientations downstream of an in vitro promoter, will transcribe both 
strands of the target to create a dsRNA oligonucleotide of the desired target 
sequence. Any of the above RNA species will be designed to include a portion of 
nucleic acid sequence represented in a POSH or POSH-AP nucleic acid, such as, for 

10 example, a nucleic acid that hybridizes, under stringent and/or physiological 
conditions, to any of SEQ ID Nos: 1, 3, 4, 6, 8 and 10 and complements thereof or 
any of the POSH-AP sequences presented in Figure 36. 

The specific sequence utilized in design of the oligonucleotides may be any 
contiguous sequence of nucleotides contained within the expressed gene message of 

15 the target. Programs and algorithms, known in the art, may be used to select 
appropriate target sequences. In addition, optimal sequences may be selected 
utilizing programs designed to predict the secondary structure of a specified single 
stranded nucleic acid sequence and allowing selection of those sequences likely to 
occur in exposed single stranded regions of a folded mRNA. Methods and 

20 compositions for designing appropriate oligonucleotides may be found, for example, 
in U.S. Patent Nos. 6,251,588, the contents of which are incorporated herein by 
reference. Messenger RNA (mRNA) is generally thought of as a linear molecule 
which contains the information for directing protein synthesis within the sequence of 
ribonucleotides, however studies have revealed a number of secondary and tertiary 

25 structures that exist in most mRNAs. Secondary structure elements in RNA are 
formed largely by Watson-Crick type interactions between different regions of the 
same RNA molecule. Important secondary structural elements include 
intramolecular double stranded regions, hairpin 1 oops, bulges in duplex RNA and 
internal loops. Tertiary structural elements are formed when secondary structural 

30 elements come in contact with each other or with single stranded regions to produce 
a more complex three dimensional structure. A number of researchers have 
measured the binding energies of a large number of RNA duplex structures and have 
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derived a set of rules which can be used to predict the secondary structure of RNA 
(see e.g., Jaeger et al. (1989) Proc. Natl. Acad. Sci. USA 86:7706 (1989); and 
Turner et al. (1988) Annu. Rev. Biophys. Biophys. Chem. 17:167) . The rules are 
useful in identification of RNA structural elements and, in particular, for identifying 
5 single stranded RNA regions which may represent preferred segments of the mRNA 
to target for silencing RNAi, ribozyme or antisense technologies. Accordingly, 
preferred segments of the mRNA target can be identified for design of the RNAi 
mediating dsRNA oligonucleotides as well as for design of appropriate ribozyme 
and hammerheadribozyme compositions of the application. 

10 The dsRNA oligonucleotides may be introduced into the cell by transfection 

with an heterologous target gene using carrier compositions such as liposomes, 
which are known in the art- e.g., Lipofectamine 2000 (Life Technologies) as 
described by the manufacturer for adherent cell lines. Transfection of dsRNA 
oligonucleotides for targeting endogenous genes may be carried out using 

15 Oligofectamine (Life Technologies). Transfection efficiency may be checked using 
fluorescence microscopy for mammalian cell lines after co-transfection of hGFP- 
encoding pAD3 (Kehlenback et al. (1998) J Cell Biol 141: 863-74). The 
effectiveness of the RNAi may be assessed by any of a number of assays following 
introduction of the dsRNAs. These include Western blot analysis using antibodies 

20 which recognize the POSH or POSH-AP gene product following sufficient time for 
turnover o f t he e ndogenous p ool a fter n ew p rotein s ynthesis i s r epressed, r everse 
transcriptase polymerase chain reaction and Northern blot analysis to determine the 
level of existing POSH or POSH-AP target mRNA. 

Further compositions, methods and applications of RNAi technology are 

25 provided in U.S. Patent Application Nos. 6,278,039, 5,723,750 and 5,244,805, 
which are incorporated herein by reference. 

Ribozyme molecules designed to catalytically cleave POSH or POSH-AP 
mRNA transcripts can also be used to prevent translation of suject POSH or POSH- 
AP mRNAs and/or expression of POSH or POSH-APs (see, e.g., PCT International 

30 Publication WO90/11364, published October 4, 1990; Sarver et al. (1990) Science 
247:1222-1225 and U.S. Patent No. 5,093,246). Ribozyrnes are enzymatic RNA 
molecules capable of catalyzing the specific cleavage of RNA. (For a review, see 
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Rossi (1994) Current Biology 4: 469-471). The mechanism of ribozyme action 
involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 
composition of ribozyme molecules preferably includes one or more sequences 
5 complementary to a POSH or POSH-AP mRNA, and the well known catalytic 
sequence responsible for mRNA cleavage or a functionally equivalent sequence 
(see, e.g., U.S. Pat. No. 5,093,246, which is incorporated herein by reference in its 
entirety). 

While ribozymes that cleave mRNA at site specific recognition sequences 

10 can be used to destroy target mRNAs, the use of hammerhead ribozymes is 
preferred. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking 
regions that form complementary base pairs with the target mRNA. Preferably, the 
target mRNA has the following sequence of two bases: S'-UG-S'. The construction 
and production of hammerhead ribozymes is well known in the art and is described 

15 more fully in Haseloff and Gerlach ((1988) Nature 334:585-591; and see PCT 
Appln. No. WO89/05852, the contents of which are incorporated herein by 
reference). H ammerhead ribozyme s equences c an b e embedded in a stable RNA 
such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo (Perriman et 
al. (1995) Proc. Natl. Acad. Sci. USA, 92: 6175-79; de Feyter, and Gaudron, 

20 Methods in Molecular Biology, Vol. 74, Chapter 43, "Expressing Ribozymes in 
Plants", E dited b y T urner, P . C , H umana P ress Inc., T otowa, N .J). In p articular, 
RNA polymerase Hi-mediated expression of tRNA fusion ribozymes are well 
known in the art ( see Kawasaki et al. (1998) Nature- 393: 284-9; Kuwabara et al. 
(1998) Nature Biotechnol. 16: 961-5; and Kuwabara et al. (1998) Mol. Cell 2: 617- 

25 27; Koseki et al. (1999) I Virol 73: 1868-77; Kuwabara et al. (1999) Proc Natl Acad 
Sci USA 96: 1886-91; Tanabe et al. (2000) Nature 406: 473-4). There are typically 
a number of potential hammerhead ribozyme cleavage sites within a given target 
cDNA sequence. Preferably the ribozyme is engineered so that the cleavage 
recognition site is located near the 5' end of the target mRNA- to increase efficiency 

30 and minimize the intracellular accumulation of non-functional mRNA transcripts. 
Furthermore, the use of any cleavage recognition site located in the target sequence 
encoding different portions of the C-terminal amino acid domains of, for example, 
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long and short forms of target would allow the selective targeting of one or the other 
form of the target, and thus, have a selective effect on one form of the target gene 
product. 

Gene targeting ribozymes necessarily contain a hybridizing region 
5 complementary to two regions, each of at least 5 and preferably each 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 contiguous nucleotides in length of a POSH 
or POSH-AP mPvNA, such as an mRNA of a sequence represented in any of SEQ ID 
Nos: 1, 3, 4, 6, 8 or 10 or a POSH-AP presented in Figure 36. In addition, 
ribozymes possess highly specific endoribonuclease activity, which autocatalytically 
10 cleaves the target sense mRNA. The present application extends to ribozymes 
which hybridize to a sense mRNA encoding a POSH gene such as a therapeutic drug 
target candidate gene, thereby hybridising to the sense mRNA and cleaving it, such 
that it is no longer capable of being translated to synthesize a functional polypeptide 
product. 

15 The ribozymes of the present application also include RNA 

endoribonucleases (hereinafter "Cech-type ribozymes") such as the one which 
occurs naturally in Tetrahymena thermophila (known as the TVS, or L-19 IVS RNA) 
and which has been extensively described by Thomas Cech and collaborators (Zaug, 
et al. (1984) Science 224:574-578; Zaug, et al. (1986) Science 231:470-475; Zaug, 

20 et al. (1986) Nature 324:429-433; published International patent application No. 
WO88/04300 by University Patents Inc.; Been, et al. (1986) Cell 47:207-216). The 
Cech-type ribozymes have an eight base pair active site which hybridizes to a target 
RNA sequence whereafter cleavage of the target RNA takes place. The application 
encompasses those Cech-type ribozymes which target eight base-pair active site 

25 sequences that are present in a target gene or nucleic acid sequence. 

Ribozymes can be composed of modified oligonucleotides (e.g., for 
improved stability, targeting, etc.) and should be delivered to cells which express the 
target gene in vivo. A preferred method of delivery involves using a DNA construct 
"encoding" the ribozyme under the control of a strong constitutive pol III or pol II 
30 promoter, so that transfected cells will produce sufficient quantities of the ribozyme 
to destroy endogenous target messages and inhibit translation. Because ribozymes, 
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unlike antisense molecules, are catalytic, a lower intracellular concentration is 
required for efficiency. 

In certain embodiments, a ribozyme may be designed by first identifying a 
sequence portion sufficient to cause effective knockdown by RNAi. The same 
5 sequence portion may then be incorporated into a ribozyme. In this aspect of the 
application, the gene-targeting portions of the ribozyme or RNAi are substantially 
the same sequence of at least 5 and preferably 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19 or 20 or more contiguous nucleotides of a POSH nucleic acid, such as a 
nucleic acid of any of SEQ ID Nos: 1, 3, 4, 6, 8, or 10 or POSH-AP nucleic acid, as 
10 presented in Figure 36. In a long target RNA chain, significant numbers of target 
sites are not accessible to the ribozyme because they are hidden within secondary or 
tertiary structures (Birikh et al. (1997) Eur J Biochem 245: 1-16). To overcome the 
problem of target RNA accessibility, computer generated predictions of secondary 
structure are typically used to identify targets that are most likely to be single- 
1 5 stranded or have an "open" configuration (see Jaeger et al. (1989) Methods Enzymol 
183: 281-306). Other approaches utilize a systematic approach to predicting 
secondary structure which involves assessing a huge number of candidate 
hybridizing oligonucleotides molecules (seeMilner et al. (1997) Nat Biotechnol 15: 
537-41; and Patzel and Sczakiel (1998) Nat Biotechnol 16: 64-8). Additionally, U.S. 
20 Patent No. 6,251,588, the contents of which are hereby incorporated herein, 
describes methods for evaluating oligonucleotide probe sequences so as to predict 
the potential for hybridization to a target nucleic acid sequence. The method of the 
application provides for the use of such methods to select preferred segments of a 
target mRNA sequence that are predicted to be single-stranded and, further, for the 
25 opportunistic utilization of the same or substantially identical target mRNA 
sequence, preferably comprising about 10-20 consecutive nucleotides of the target 
mRNA, in the design of both the RNAi oligonucleotides and ribozymes of the 
application. 

A further aspect of the application relates to the use of the isolated 
30 "antisense" nucleic acids to inhibit expression, e.g., by inhibiting transcription 
and/or translation of a POSH or POSH-AP nucleic acid. The antisense nucleic acids 
mav bind to the potential drug target by conventional base pair complementarity, or, 
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for example, in the case of binding to DNA duplexes, through specific interactions 
in the major groove of the double helix. In general, these methods refer to the range 
of techniques generally employed in the art, and include any methods that rely on 
specific binding to oligonucleotide sequences. 
5 An antisense construct of the present application can be delivered, for 

example, as an expression plasmid which, when transcribed in the cell, produces 
RNA w hich i s complementary t o at 1 east a u nique p ortion o f t he cellular m RNA 
which encodes a POSH or POSH-AP polypeptide. Alternatively, the antisense 
construct is an oligonucleotide probe, which is generated ex vivo and which, when 

10 introduced into the cell causes inhibition of expression by hybridizing with the 
mRNA and/or genomic sequences of a POSH or POSH-AP nucleic acid. Such 
oligonucleotide probes are preferably modified oligonucleotides, which are resistant 
to endogenous nucleases, e.g., exonucleases and/or endonucleases, and are therefore 
stable in vivo. Exemplary nucleic acid molecules for use as antisense 

15 oligonucleotides are phosphoramidate, phosphothioate and methylphosphonate 
analogs of DNA (see also U.S. Patents 5,176,996; 5,264,564; and 5,256,775). 
Additionally, general approaches to constructing oligomers useful in antisense 
therapy have been reviewed, for example, by Van der Krol et al. (1988) 
BioTechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 2668. 

20 With respect to antisense DNA, oligodeoxyribonucleotides derived from the 

translation initiation site, e.g., between the -10 and +10 regions of the target gene, 
are preferred. Antisense approaches involve the design of oligonucleotides (either 
DNA or RNA) that are complementary to mRNA encoding a POSH or POSH-AP 
polypeptide. The antisense oligonucleotides will bind to the mRNA transcripts and 

25 prevent translation. Absolute complementarity, although preferred, is not required. 
In the case of double-stranded antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the more 

30 base mismatches with an RNA it may contain and still form a stable duplex (or 
triplex, as the case may be). One skilled in the art can ascertain a tolerable degree of 
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mismatch by use of standard procedures to determine the melting point of the 

hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the mRNA, e.g., the 
5' untranslated sequence up to and including the AUG initiation codon, should work 
5 most efficiently at inhibiting translation. However, sequences complementary to the 
3' u ntranslated s equences o f m RNAs h ave r ecently b een s hown t o b e e ffective a t 
inhibiting translation of mRNAs as well. (Wagner, R. 1994. Nature 372:333). 
Therefore, oligonucleotides complementary to either the 5' or 3* untranslated, non- 
coding regions of a gene could be used in an antisense approach to inhibit translation 
10 of that mRNA. Oligonucleotides complementary to the 5' untranslated region of the 
mRNA should include the complement of the AUG start codon. Antisense 
oligonucleotides complementary to mRNA coding regions are less efficient 
[ inhibitors of translation but could also be used in accordance with the application. 
Whether designed to hybridize to the 5', 3' or coding region of mRNA, antisense 
15 nucleic acids should be at least six nucleotides in length, and are preferably less that 
about 100 and more preferably less than about 50, 25, 17 or 10 nucleotides in length. 

It is preferred that in vitro studies are first performed to quantitate the ability 
of the antisense oligonucleotide to inhibit gene expression. It is preferred that these 
studies utilize controls that distinguish between antisense gene inhibition and 
20 nonspecific biological effects of oligonucleotides. It is also preferred that these 
studies compare levels of the target RNA or protein with that of an internal control 
RNA or protein. Results obtained using the antisense oligonucleotide may be 
compared with those obtained using a control oligonucleotide. It is preferred that 
the control oligonucleotide is of approximately the same length as the test 
25 oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from 
the antisense sequence no more than is necessary to prevent specific hybridization to 

the target sequence. 

The antisense oligonucleotides can be DNA or RNA or chimeric mixtures or 
derivatives or modified versions thereof, single-stranded or double-stranded. T he 
30 oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate 
backbone, for example, to improve stability of the molecule, hybridization, etc. The 
oligonucleotide may include other appended groups such as peptides (e.g., for 
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targeting host cell receptors), or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553- 
6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
W088/09810, published December 15, 1988) or the blood- brain barrier (see, e.g., 
5 PCT Publication No. W089/10134, published April 25, 1988), hybridization- 
triggered cleavage agents. (See, e.g., Krol et al., 1988, BioTechniques 6:958- 976) 
or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, 
the oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 

10 cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base 
moiety which is selected from the group including but not limited to 5-fluorouracil, 
5- bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4- 
acetylcytosine, 5- (carboxyhydroxytiethyl) uracil, 5-carboxymethylaminomethyl-2- 

15 thiouridine, 5- carboxymethylaminomethyluracil, dihydrouracil, beta-D- 
galactosylqueosine, inosine, N6- isopentenyladenine, 1-methylguanine, 1- 
methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 
methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-memoxyaminomethyl-2-thiouracil, beta-D- 

20 mannosylqueosine, 5'-me1hoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6- isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5- 
methyl-2-thiouracil, 3-(3-amino-3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

25 diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including but not limited to arabinose, 2- 
fluoroarabinose, xylulose, and hexose. 

The antisense oligonucleotide can also contain a neutral peptide-like 
30 backbone. Such molecules are termed peptide nucleic acid (PNA)-oligomers and 
are described, e.g., in Perry-OTCeefe et al. (1996) Proc. Natl. Acad. Sci. U.S.A. 
93:14670 and in Eglom et al. (1993) Nature 365:566. One advantage of PNA 
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oligomers is their capability to bind to complementary DNA essentially 
independently from the ionic strength of the medium due to the neutral backbone of 
the DNA. In yet another embodiment, the antisense oligonucleotide comprises at 
least one modified phosphate backbone selected from the group consisting of a 
5 phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 
phosphotriester, and a formacetal or analog thereof. 

In yet a further embodiment, the antisense oligonucleotide is an alpha- 
anomeric oligonucleotide. An alpha-anomeric oligonucleotide forms specific 

10 double-stranded hybrids with complementary RNA in which, contrary to the usual 
antiparallel orientation, the strands run parallel to each other (Gautier et al., 1987, 
Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a ^-O-methylribonucleotide 
(Inoue et al., 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

15 While antisense nucleotides complementary to the coding region of a POSH ' 

or POSH-AP mRNA sequence can be used, those complementary to the transcribed 
untranslated region may also be used. 

In certain instances, it may be difficult to achieve intracellular concentrations 
of the antisense sufficient to suppress translation on endogenous mRNAs. Therefore 

20 a preferred approach utilizes a recombinant DNA construct in which the antisense 
oligonucleotide is placed under the control of a strong pol III or polll promoter. 
The use of such a construct to transfect target cells will result in the transcription of 
sufficient amounts of single stranded RNAs that will form complementary base pairs 
with the endogenous potential drug target transcripts and thereby prevent translation. 

25 For example, a vector can be introduced such that it is taken up by a cell and directs 
the transcription of an antisense RNA. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the 
desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art. Vectors can be plasmid, viral, or others 

30 known in the art, used for replication and expression in mammalian cells. 
Expression o f the s equence encoding the antisense RNA c an b e b y any promoter 
known in the art to act in mammalian, preferably human cells. Such promoters can 
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be inducible or constitutive. Such promoters include but are not limited to: the 
S V40 e arly p romoter r egion ( Bernoist a nd C hambon, 1 98 1 , N ature 2 90:304-3 1 0), 
the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter 
5 (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory 
sequences of the metallothionein gene (Brinster et al, 1982, Nature 296:39-42), etc. 
Any type of plasmid, cosmid, YAC or viral vector can be used to prepare the 
recombinant DNA construct, which can be introduced directly into the tissue site. 

Alternatively, POSH or POSH-AP gene expression can be reduced by 
10 targeting deoxyribonucleotide sequences complementary to the regulatory region of 
the gene (i.e., the promoter and/or enhancers) to form triple helical structures that 
prevent transcription of the gene in target cells in the body. (See generally, Helene, 
C. 1991, Anticancer Drug Des., 6(6):569-84; Helene, C, et al., 1992, Ann. N.Y. 
Acad. Sci., 660:27-36; and Maher, L.J., 1992, Bioassays 14(12):807-15). 
15 Nucleic acid molecules to be used in triple helix formation for the inhibition 

of transcription are preferably single stranded and composed of 
deoxyribonucleotides. The base composition of these oligonucleotides should 
promote triple helix formation via Hoogsteen base pairing rules, which generally 
require sizable stretches of either purines or pyrimidines to be present on one strand 
20 of a duplex. Nucleotide sequences may be pyrimidine-based, which will result in 
TAT and CGC triplets across the three associated strands of the resulting triple 
helix. The pyrimidine-rich molecules provide base complementarity to a purine-rich 
region of a single strand of the duplex in a parallel orientation to that strand. In 
addition, nucleic acid molecules may be chosen that are purine- rich, for example, 
25 containing a stretch of G residues. These molecules will form a triple helix with a 
DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting in CGC triplets across the 
three strands in the triplex. 

Alternatively, POSH or POSH-AP sequences that can be targeted for triple 
30 helix formation may be increased by creating a so called "switchback" nucleic acid 
molecule. Switchback molecules are synthesized in an alternating 5'-3', 3*-5* 
manner, such that they base pair with first one strand of a duplex and then the other, 
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eliminating the necessity for a sizable stretch of either purines or pyrimidines to be 
present on one strand of a duplex. 

A further aspect of the application relates to the use of DNA enzymes to 
inhibit expression of a POSH or POSH-AP gene. DNA enzymes incorporate some 
5 of the mechanistic features of both antisense and ribozyme technologies. DNA 
enzymes are designed so that they recognize a particular target nucleic acid 
sequence, much like an antisense oligonucleotide, however much like a ribozyme 
they are catalytic and specifically cleave the target nucleic acid. 

There are currently two basic types of DNA enzymes, and both of these were 
10 identified by Santoro and Joyce (see, for example, US Patent No. 6110462). The 
10-23 DNA enzyme comprises a loop structure which connect two arms. The two 
arms provide specificity by recognizing the particular target nucleic acid sequence 
while the loop structure provides catalytic function under physiological conditions. 

Briefly, to design an ideal DNA enzyme that specifically recognizes and 
15 cleaves a target nucleic acid, one of skill in the art must first identify the unique 
target sequence. This can be done using the same approach as outlined for antisense 
oligonucleotides. Preferably, the unique or substantially sequence is a G/C rich of 
approximately 18 to 22 nucleotides. High G/C content helps insure a stronger 
interaction between the DNA enzyme and the target sequence. 
20 When synthesizing the DNA enzyme, the specific antisense recognition 

sequence that will target the enzyme to the message is divided so that it comprises 
the two arms of the DNA enzyme, and the DNA enzyme loop is placed between the 
two specific arms. 

Methods of making and administering DNA enzymes can be found, for 
25 example, in US 61 10462. Similarly, methods of delivery DNA ribozymes in vitro or 
in vivo include methods of delivery RNA ribozyme, as outlined in detail above. 
Additionally, one of skill in the art will recognize that, like antisense 
oligonucleotide, DNA enzymes can be optionally modified to improve stability and 
improve resistance to degradation. 
30 Antisense RNA and DNA, ribozyme, RNAi and triple helix molecules of the 

application may be prepared by any method known in the art for the synthesis of 
tynta smrf RNA molecules. These include techniques for chemically synthesizing 
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oligodeoxyribonucleotides and oligoribonucleotides well known in the art such as 
for example solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase 
promoters s uch astheT7orSP6p olymerase p romoters. A Itematively, antisense 
cDNA constructs that synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably into cell lines. Moreover, 
various well-known modifications to nucleic acid molecules may be introduced as a 
means of increasing intracellular stability and half-life. Possible modifications 
include but are not limited to the addition of flanking sequences of ribonucleotides 
or deoxyribonucleotides to the 5' and/or 3' ends of the molecule or the use of 
phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the 
oligodeoxyribonucleotide backbone. 

6. Drug Screening Assays 

In certain aspects, the present application provides assays for identifying 
therapeutic agents which either interfere with or promote POSH or POSH-AP 
function. In certain aspects, the present application also provides assays for 
identifying therapeutic agents which either interfere with or promote the complex 
formation between a POSH polypeptide and a POSH-AP polypeptide. 

In certain embodiments, agents of the application are antiviral agents, 
optionally interfering with viral maturation, and preferably where the virus is an 
envelope virus, and optionally a retroid virus or an RNA virus. In other 
embodiments, agents of the application are anticancer agents. In further 
embodiments, agents of the application inhibit the progression of a 
neurodegenerative disorder. In certain embodiments, an antiviral or anticancer agent 
or an agent that inhibits the progression of a neurodegenerative disorder interferes 
with the ubiquitin ligase catalytic activity of POSH (e.g., POSH auto-ubiquitination 
or transfer to a target protein). In other embodiments, agents disclosed herein inhibit 
or promote POSH and POSH-AP mediated cellular processes such as apoptosis and 
protein processing in the secretory pathway. 
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In certain preferred embodiments, an antiviral agent interferes with the 
interaction b etween P OSH and a P OSH-AP p olypeptide, for example an antiviral 
agent may disrupt or render irreversible interaction between a POSH polypeptide 
and POSH-AP polypeptide (as in the case of a POSH dimer, a heterodimer of two 
5 different POSH polypeptides, homomultimers and heteromultimers). In further 
embodiments, agents of the application are anti-apoptotic agents, optionally 
interfering with JNK and/or NF-kB signaling. In yet additional embodiments, 
agents of the application interfere with the signaling of a GTPase, such as Rac or 
Ras, optionally disrupting the interaction between a POSH polypeptide and a Rac 
10 protein. In certain embodiments, agents of the application modulate the ubiquitin 
ligase activity of POSH and may be used to treat certain diseases related to ubiquitin 
ligase activity. In certain embodiments, agents of the application interfere with the 
trafficking of a protein through the secretory pathway. 

In certain embodiments, the application provides assays to identify, optimize 
15 or otherwise assess agents that increase or decrease a ubiquitin-related activity of a 
POSH polypeptide. Ubiquitin-related activities of POSH polypeptides may include 
the self-ubiquitination activity of a POSH polypeptide, generally involving the 
transfer of ubiquitin from an E2 enzyme to the POSH polypeptide, and the 
ubiquitination of a target protein, generally involving the transfer of a ubiquitin from 
20 a POSH polypeptide to the target protein. In certain embodiments, a POSH activity 
is mediated, at least in part, by a POSH RING domain. 

In certain embodiments, an assay comprises forming a mixture comprising a 
POSH polypeptide, an E2 polypeptide and a source of ubiquitin (which may be the 
E2 polypeptide pre-complexed with ubiquitin). Optionally the mixture comprises an 
25 El polypeptide and optionally the mixture comprises a target polypeptide. 
Additional components of the mixture may be selected to provide conditions 
consistent with the ubiquitination of the POSH polypeptide. One or more of a 
variety of parameters may be detected, such as POSH-ubiquitin conjugates, E2- 
ubiquitin thioesters, free ubiquitin and target polypeptide-ubiquitin complexes. The 
30 term "detect" is used herein to include a determination of the presence or absence of 
the subject of detection (e.g., POSH-ubiqutin, E2-ubiquitin, etc.), a quantitative 
measure of the amount of the subject of detection, or a mathematical calculation of 



-79- 



WO 2004/078130 



PCTYUS2004/006308 



the presence, absence or amount of the subject of detection, based on the detection 
of other parameters. The term "detect" includes the situation wherein the subject of 
detection is determined to be absent or below the level of sensitivity. Detection may 
comprise detection of a label (e.g., fluorescent label, radioisotope label, and other 
5 described below), resolution and identification by size (e.g., SDS-PAGE, mass 
spectroscopy), purification and detection, and other methods that, in view of this 
specification, will be available to one of skill in the art. For instance, radioisotope 
labeling may be measured by scintillation counting, or by densitometry after 
exposure to a photographic emulsion, or by using a device such as a 
10 Phosphorimager. Likewise, densitometry may be used to measure bound ubiquitin 
following a reaction with an enzyme label substrate that produces an opaque product 
when an enzyme label is used. In a preferred embodiment, an assay comprises 
detecting the POSH-ubiquitin conjugate. 

In certain embodiments, an assay comprises forming a mixture comprising a 
15 POSH polypeptide, a target polypeptide and a source of ubiquitin (which may be the 
POSH polypeptide pre-complexed with ubiquitin). Optionally the mixture 
comprises an El and/or E2 polypeptide and optionally the mixture comprises an E2- 
ubiquitin thioester. Additional components of the mixture may be selected to 
provide conditions consistent with the ubiquitination of the target polypeptide. One 
20 or more of a variety of parameters may be detected, such as POSH-ubiquitin 
conjugates and target polypeptide-ubiquitin conjugates. In a preferred embodiment, 
an assay comprises detecting the target polypeptide-ubiquitin conjugate. In another 
preferred embodiment, an assay comprises detecting the POSH-ubiquitin conjugate. 
An assay described above may be used in a screening assay to identify agents 
25 that modulate a ubiquitin-related activity of a POSH polypeptide. A screening assay 
will generally involve adding a test agent to one of the above assays, or any other 
assay designed to assess a ubiquitin-related activity of a POSH polypeptide. The 
parameter(s) detected in a screening assay may be compared to a suitable reference. 
A suitable reference may be an assay run previously, in parallel or later that omits 
30 the test agent. A suitable reference may also be an average of previous 
measurements in the absence of the test agent. In general the components of a 
screening assay mixture may be added in any order consistent with the overall 
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activity to be assessed, but certain variations may be preferred. For example, in 
certain embodiments, it may be desirable to pre-incubate the test agent and the E3 
(e.g., the POSH polypeptide), followed by removing the test agent and addition of 
other components to complete the assay. In this manner, the effects of the agent 
5 solely on the POSH polypeptide may be assessed. In certain preferred 
embodiments, a screening assay for an antiviral agent employs a target polypeptide 
comprising an L domain, and preferably an HIV L domain. 

In certain embodiments, an assay is performed in a high-throughput format. 
For example, one of the components of a mixture may be affixed to a solid substrate 

10 and one or more of the other components is labeled. For example, the POSH 
polypeptide may be affixed to a surface, such as a 96-well plate, and the ubiquitin is 
in solution and labeled. An E2 and El are also in solution, and the POSH-ubiquitin 
conjugate formation may be measured by washing the solid surface to remove 
uncomplexed labeled ubiquitin and detecting the ubiquitin that remains bound. 

15 Other variations may be used. For example, the amount of ubiquitin in solution may 
be detected. In certain embodiments, the formation of ubiquitin complexes may be 
measured by an interactive technique, such as FRET, wherein a ubiquitin is labeled 
with a first label and the desired complex partner (e.g., POSH polypeptide or target 
polypeptide) is labeled with a second label, wherein the first and second label 

20 interact when they come into close proximity to produce an altered signal. In 
FRET, the first and second labels are fluorophores. FRET is described in greater 
detail below. The formation of polyubiquitin complexes may be performed by 
mixing two or more pools of differentially labeled ubiquitin that interact upon 
formation of a polyubiqutin (see, e.g., US Patent Publication 20020042083). High- 

25 throughput may be achieved by performing an interactive assay, such as FRET, in 
solution a s w ell. In a ddition, i f a p olypeptide in t he m ixture, s uch a s the P OSH 
polypeptide or target polypeptide, is readily purifiable (e.g., with a specific antibody 
or via a tag such as biotin, FLAG, polyhistidine, etc.), the reaction may be 
performed in solution and the tagged polypeptide rapidly isolated, along with any 

30 polypeptides, such as ubiquitin, that are associated with the tagged polypeptide. 
Proteins may also be resolved by SDS-PAGE for detection. 
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In certain embodiments, the ubiquitin is labeled, either directly or indirectly. 
This typically allows for easy and rapid detection and measurement of ligated 
ubiquitin, m aking the assay useful for high-throughput screening applications. As 
descrived above, certain embodiments may employ one or more tagged or labeled 
5 proteins. A "tag" is meant to include moieties that facilitate rapid isolation of the 
tagged polypeptide. A tag may be used to facilitate attachment of a polypeptide to a 
surface. A "label" is meant to include moieties that facilitate rapid detection of the 
labeled polypeptide. Certain moieties may be used both as a label and a tag (e.g., 
epitope tags that are readily purified and detected with a well-characterized 
10 antibody). Biotinylation of polypeptides is well known, for example, a large number 
of biotinylation agents are known, including amine-reactive and thiol-reactive 
agents, for the biotinylation of proteins, nucleic acids, carbohydrates, carboxylic 
acids; see chapter 4, Molecular Probes Catalog, Haugland, 6th Ed. 1996, hereby 
incorporated by reference. A biotinylated substrate can be attached to a biotinylated 
15 component via avidin or streptavidin. Similarly, a large number of haptenylation 
reagents are also known. 

An "El" is a ubiquitin activating enzyme. In a preferred embodiment, El is 
capable of transferring ubiquitin to an E2. In a preferred embodiment, El forms a 
high energy thiolester bond with ubiquitin, thereby "activating" the ubiquitin. An 
20 "E2" is a ubiquitin carrier enzyme (also known as a ubiquitin conjugating enzyme). 
In a preferred embodiment, ubiquitin is transferred from El to E2. In a preferred 
embodiment, the transfer results in a thiolester bond formed between E2 and 
ubiquitin. In a preferred embodiment, E2 i s c apable o f transferring ubiquitin to a 
POSH polypeptide. 

25 In an alternative embodiment, a POSH polypeptide, E2 or target polypeptide 

is bound to a bead, optionally with the assistance of a tag. Following ligation, the 
beads may be separated from the unbound ubiquitin and the bound ubiquitin 
measured. In a preferred embodiment, POSH polypeptide is bound to .beads and the 
composition used includes labeled ubiquitin. In this embodiment, the beads with 

30 bound ubiquitin may be separated using a fluorescence-activated cell sorting 
(FACS) machine. Methods for such use are described in U.S. patent application Ser. 



-82- 



WO 2004/078130 



PCT7US2004/006308 



No. 09/047,119, which is hereby incorporated in its entirety. The amount of bound 
ubiquitin can then be measured. 

In a screening assay, the effect of a test agent may be assessed by, for 
example, assessing the effect of the test agent on kinetics, steady-state and/or 
5 endpoint of the reaction. 

The components of the various assay mixtures provided herein may be 
combined in varying amounts. In a preferred embodiment, ubiquitin (or E2 
complexed ubiquitin) is combined at a final concentration of from 5 to 200 ng per 
100 microliter reaction solution. Optionally El is used at a final concentration of 

10 from 1 to 50 ng per 100 microliter reaction solution. Optionally E2 is combined at a 
final concentration of 10 to 100 ng per 100 microliter reaction solution, more 
preferably 10-50 ng per 100 microliter reaction solution. In a preferred embodiment, 
POSH polypeptide is combined at a final concentration of from 1 to 500 ngper 100 
microliter reaction solution. 

15 Generally, an assay mixture is prepared so as to favor ubiquitin ligase 

activity and/or ubiquitination activity. Generally, this will be physiological 
conditions, such as 50 - 200 mM salt (e.g., NaCl, KC1), pH of between 5 and 9, and 
preferably between 6 and 8. Such conditions may be optimized through trial and 
error. Incubations m ay be p erformed a t any t emperature which f acilitates o ptimal 

20 activity, typically between 4 and 40 °C. Incubation periods are selected for optimum 
activity, but may also be optimized to facilitate rapid high through put s creening. 
Typically between 0.5 and 1.5 hours will be sufficient. A variety of other reagents 
may be included in the compositions. These include reagents like salts, solvents, 
buffers, neutral proteins, e.g., albumin, detergents, etc. which may be used to 

25 facilitate optimal ubiquitination enzyme activity and/or reduce non-specific or 
background interactions. Also reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., 
may be used. The compositions will also preferably include adenosine tri-phosphate 
(ATP). The mixture of components may be added in any order that promotes 

30 ubiquitin ligase activity or optimizes identification of candidate modulator effects. In 
a preferred embodiment, ubiquitin is provided in a reaction buffer solution, followed 
by addition of the ubiquitination enzymes. In an alternate preferred embodiment, 
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ubiquitin is provided in a reaction buffer solution, a candidate modulator is then 
added, followed by addition of the ubiquitination enzymes. 

In general, a test agent that decreases a POSH ubiquitin-related activity may 
be used to inhibit POSH function in vivo, while a test agent that increases a POSH 
5 ubiquitin-related activity may be used to stimulate POSH function in vivo. Test 
agent may be modified for use in vivo, e.g., by addition of a hydrophobic moiety, 
such as an ester. 

In certain embodiments, a ubiquitination assay as described above for POSH 
can similarly be conducted for a Cbl-b, a SIAH1, or a TTC3 polypeptide. In certain 

10 embodiments, the application provides assays to identify, optimize or otherwise 
assess agents that increase or decrease a ubiquitin-related activity of a Cbl-b, a 
SIAH1, or a TTC3 polypeptide. Ubiquitin-related activities of Cbl-b, SIAH1, or 
TTC3 polypeptides may include the self-ubiquitination activity of a Cbl-b, SIAH1, 
or TTC3 polypeptide, generally involving the transfer of ubiquitin from an E2 

15 enzyme to the respective Cbl-b, SIAH1, or TTC3 polypeptide, and the ubiquitination 
of a target protein, e.g., the p85 subunit of PI3K, e.g, synaptophysin, generally 
involving the transfer of a ubiquitin from a Cbl-b, SIAH1, or TTC3 polypeptide to 
the target protein, e.g, the p85 subunit of PI3K, e.g., synaptophysin, e.g., 
HERPUD1. In certain embodiments, a Cbl-b, a SIAH1, or a TTC3 activity is 

20 mediated, at least in part, by a RING domain of a Cbl-b, a SIAH1, or a TTC3, 
respectively. 

An additional POSH-AP may be added to a POSH ubiquitination assay to 
assess the effect of the POSH-AP (e.g., PRKAR1 A, PRKACA, or PRKACB) on 
POSH-mediated ubiquitination and/or to assess whether the POSH-AP is a target for 
25 POSH-mediated ubiquitination (e.g., HERPUD1 , e.g., PKA). 

Certain embodiments of the application relate to assays for identifying agents 
that b ind t o a P OSH o r P OSH-AP p olypeptide, o ptionally a p articular domain o f 
POSH such as an SH3 or RING domain or a particular domain of a POSH-AP, 
particularly a kinase catalytic domain or ATP binding domain. In preferred 
30 embodiments, a POSH polypeptide is a polypeptide comprising the fourth SH3 
domain of hPOSH (SEQ ID NO: 30). A wide variety of assays may be used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic 
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mobility shift assays, immunoassays for protein binding, and the like. The purified 
protein may also be used for determination of three-dimensional crystal structure, 
which can be used for modeling intermolecular interactions and design of test 
agents. In one embodiment, an assay detects agents which inhibit interaction of one 
5 or more subject POSH polypeptides with a POSH-AP. In another embodiment, the 
assay detects agents which modulate the intrinsic biological activity of a POSH 
polypeptide or POSH complex, such as an enzymatic activity, binding to other 
cellular components, cellular compartmentalization, and the like. 

Certain embodiments of the application relate to assays for identifying agents 
10 that modulate a POSH-AP polypeptide such as a PKA subunit polypeptide. 
Preferred PKA subunit polypeptides include PRKAR1 A, PRKACA, and PRKACB. 
Exemplary assays used for this purpose may include detecting phosphorylation of 
PKA subunit, kinase activity of the PKA subunit, ability of the PKA subunit to elicit 
downstream signaling of the PKA pathway, and the like. For example, activity of 
15 protein kinase A can be assayed either in vitro or in vivo. PKA activity can be 
determined b y d etecting p osphorylation o f a P KA specific s ubstrate. T he s pecific 
PKA substrate can be any convenient peptide with a serine that is recognized as a 
phosphorylation site by PKA. For example, the peptide substrate can have the 
sequence: Leu-Arg-Arg-Ala- Ser-Leii-Gly. 
20 * In one aspect, the application provides methods and compositions for the 

identification of compositions that interfere with the function of POSH or POSH-AP 
polypeptides. Given the role of POSH polypeptides in viral production, 
compositions that perturb the formation or stability of the protein-protein 
interactions between POSH polypeptides and the proteins that they interact with, 
25 such as POSH-APs, and particularly POSH complexes comprising a viral protein, 
are candidate pharmaceuticals for the treatment of viral infections. 

While not wishing to be bound to mechanism, it is postulated that POSH 
polypeptides promote the assembly of protein complexes that are important in 
release of virions and other biological processes. Complexes of the application may 
30 include a combination of a POSH polypeptide and a POSH-AP. Exemplary 
complexes may comprise one or more of the following: a POSH polypeptide (as in 
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the case of a POSH dimer, a heterodimer of two different POSH, homomultimers 
and heteromultimers); a HERPUD1 polypeptide; or an MSTP028 polypeptide. 

In an assay for an antiviral or antiapoptotic agent, the test agent is assessed 
for its ability to disrupt or inhibit the formation of a complex of a POSH polypeptide 
5 and a small GTPase, such as a Rac polypeptide, particularly a human Rac 
polypeptide, such as Racl. 

A variety of assay formats will suffice and, in light of the present disclosure, 
those not expressly described herein will nevertheless be comprehended by one of 
ordinary skill in the art. Assay formats which approximate such conditions as 
10 formation of protein complexes, enzymatic activity, and even a POSH polypeptide- 
mediated membrane reorganization or vesicle formation activity, may be generated 
in many different forms, and include assays based on cell-free systems, e.g., purified 
proteins or cell lysates, as well as cell-based assays which utilize intact cells. 
Simple binding assays can also be used to detect agents which bind to POSH. Such 
15 binding assays may also identify agents that act by disrupting the interaction 
between a POSH polypeptide and a POSH interacting protein, or the binding of a 
POSH polypeptide or complex to a substrate. Agents to be tested can be produced, 
for example, by bacteria, yeast or other organisms (e.g., natural products), produced 
chemically (e.g., small molecules, including peptidomimetics), or produced 
20 recombinantly. In a preferred embodiment, the test agent is a small organic 
molecule, e.g., other than a peptide or oligonucleotide, having a molecular weight of 
less than about 2,000 daltons. 

In many drug screening programs which test libraries of compounds and 
natural extracts, high throughput assays are desirable in order to maximize the 
25 number of compounds surveyed in a given period of time. Assays of the present 
application which are performed in cell-free systems, such as may be developed with 
purified or semi-purified proteins or with lysates, are often preferred as "primary" 
screens in that they can be generated to permit rapid development and relatively easy 
detection of an alteration in a molecular target which is mediated by a test 
30 compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead 
being focused primarily on the effect of the drug on the molecular target as may be 
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manifest in an alteration of binding affinity with other proteins or changes in 
enzymatic properties of the molecular target. 

In preferred in vitro embodiments of the present assay, a reconstituted POSH 
complex c omprises a reconstituted mixture o fat 1 east s emi-purified proteins. B y 
5 semi-purified, it is meant that the proteins utilized in the reconstituted mixture have 
been previously separated from other cellular or viral proteins. For instance, in 
contrast to cell lysates, the proteins involved in POSH complex formation are 
present in the mixture to at least 50% purity relative to all other proteins in the 
mixture, and more preferably are present at 90-95% purity. In certain embodiments 
10 of the subject method, the reconstituted protein mixture is derived by mixing highly 
purified proteins such that the reconstituted mixture substantially lacks other 
proteins (such as of cellular or viral origin) which might interfere with or otherwise 
alter the ability to measure POSH complex assembly and/or disassembly. 

Assaying POSH complexes, in the presence and absence of a candidate 
15 inhibitor, can be accomplished in any vessel suitable for containing the reactants. 
Examples include microtitre plates, test tubes, and micro-centrifuge tubes. 

In one embodiment of the present application, drug screening assays can be 
generated which detect inhibitory agents on the basis of their ability to interfere with 
assembly or s tability o f the P OSH c omplex. In a n exemplary binding assay, t he 
20 compound of interest is contacted with a mixture comprising a POSH polypeptide 
and at least one interacting polypeptide. Detection and quantification of POSH 
complexes provides a means for determining the compound's efficacy at inhibiting 
(or potentiating) interaction between the two polypeptides. The efficacy of the 
compound can be assessed by generating dose response curves from data obtained 
25 using various concentrations of the test compound. Moreover, a control assay can 
also be performed to provide a baseline for comparison. In the control assay, the 
formation of complexes is quantitated in the absence of the test compound. 

Complex formation between the POSH polypeptides and a substrate 
polypeptide may be detected by a variety of techniques, many of which are 
30 effectively described above. For instance, modulation in the formation of complexes 
can be quantitated using, for example, detectably labeled proteins (e.g., radiolabeled, 
fluorescently labeled, or enzymatically labeled), by immunoassay, or by 
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chromatographic detection. Surface plasmon resonance systems, such as those 
available from B iacore International AB (Uppsala, S weden), m ay also b e u sed t o 
detect protein-protein interaction 

Often, it will be desirable to immobilize one of the polypeptides to facilitate 
5 separation of complexes from uncomplexed forms of one of the proteins, as well as 
to accommodate automation of the assay. In an illustrative embodiment, a fusion 
protein can be provided which adds a domain that permits the protein to be bound to 
an insoluble matrix. For example, GST-POSH fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
10 derivatized microtitre plates, which are then combined with a potential interacting 
protein, e.g., an i5 S-labeled polypeptide, and the test compound and incubated under 
conditions c onducive t o c omplex formation. F ollowing i ncubation, the b eads are 
washed to remove any unbound interacting protein, and the matrix bead-bound 
radiolabel determined directly (e.g., beads placed in scintillant), or in the supernatant 
15 after the complexes are dissociated, e.g., when microtitre plate is used. 
Alternatively, after washing away unbound protein, the complexes can be 
dissociated from the matrix, separated by SDS-PAGE gel, and the level of 
interacting polypeptide found in the matrix-bound fraction quantitated from the gel 
using standard electrophoretic techniques. 
20 In a further embodiment, agents that bind to a POSH or POSH-AP may be 

identified by using an immobilized POSH or POSH-AP. In an illustrative 
embodiment, a fusion protein can be provided which adds a domain that permits the 
protein to be bound to an insoluble matrix. For example, GST-POSH fusion 
proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. 
25 Louis, MO) or glutathione derivatized microtitre plates, which are then combined 
with a potential labeled binding agent and incubated under conditions conducive to 
binding. Following incubation, the beads are washed to remove any unbound agent, 
and the matrix bead-bound label determined directly, or in the supernatant after the 
bound agent is dissociated. 
30 In yet another embodiment, the POSH polypeptide and potential interacting 

polypeptide can be used to generate an interaction trap assay (see also, U.S. Patent 
NO: 5.283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol 
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Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; and 
Iwabuchi et al. ( 1993) Oncogene 8 :1693-1696), f or s ubsequently d etecting agents 
which disrupt binding of the proteins to one and other. 

In particular, the method makes use of chimeric genes which express hybrid 
5 proteins. To illustrate, a first hybrid gene comprises the coding sequence for a 
DNA-binding domain of a transcriptional activator can be fused in frame to the 
coding sequence for a "bait" protein, e.g., a POSH polypeptide of sufficient length to 
bind to a potential interacting protein. The second hybrid protein encodes a 
transcriptional activation domain fused in frame to a gene encoding a "fish" protein, 
10 e.g. , a p otential i nteracting p rotein o f s ufficient length t o i nteract with t he P OSH 
polypeptide portion of the bait fusion protein. If the bait and fish proteins are able to 
interact, e.g., form a POSH complex, they bring into close proximity the two 
domains o f t he t ranscriptional a ctivator. T his p roximity c auses t ranscription o f a 
reporter gene which is operably linked to a transcriptional regulatory site responsive 
15 to the transcriptional activator, and expression of the reporter gene can be detected 
and used to score for the interaction of the bait and fish proteins. 

One aspect of the present application provides reconstituted protein 
preparations including a POSH polypeptide and one or more interacting 
polypeptides. 

20 In still further embodiments of the present assay, the POSH complex is 

generated in whole cells, taking advantage of cell culture techniques to support the 
subject assay. For example, as described below, the POSH complex can be 
constituted in a eukaryotic cell culture system, including mammalian and yeast cells. 
Often it will be desirable to express one or more viral proteins (e.g., Gag or Env) in 

25 such a cell along with a subject POSH polypeptide. It may also be desirable to 
infect the cell with a virus of interest. Advantages to generating the subject assay in 
an intact cell include the ability to detect inhibitors which are functional in an 
environment more closely approximating that which therapeutic use of the inhibitor 
would require, including the ability of the agent to gain entry into the cell. 

30 Furthermore, certain of the in vivo embodiments of the assay, such as examples 
given below, are amenable to high through-put analysis of candidate agents. 
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The components of the POSH complex can be endogenous to the cell 
selected to support the assay. Alternatively, some or all of the components can be 
derived from exogenous sources. For instance, fusion proteins can be introduced 
into the cell by recombinant techniques (such as through the use of an expression 
5 vector), as well as by microinjecting the fusion protein itself or mRNA encoding the 
fusion protein. 

In many embodiments, a cell is manipulated after incubation with a 
candidate agent and assayed for a POSH or POSH-AP activity. In certain 
embodiments, a POSH-AP, such as PTPN12, is a tyrosine phosphatase. Tyrosine 

10 phosphatase activity may be assessed by incubating a cell lysate, which has 

optionally been treated with pervanadate to stimulate tyrosine phosphorylation, with 
a POSH-AP that has tyrosine phosphatase activity, immunoprecipitating the 
substrate protein and immunoblotting for the presence of phosphorylated tyrosine. 
Alternatively, tyrosine phosphatase activity may be assessed by the substrate 

15 trapping method. The substrate trapping method employs catalytically inactive 
mutants of a tyrosine phosphatase (e.g., a POSH-AP such as PTPN12). The 
catalytically inactive phosphatase mutant is immobilized on a solid matrix (e.g., 
AG25-protein A-Sepharose beads) and incubated with a substrate protein. The solid 
matrix to which the catalytically inactive phosphatase is bound is isolated and 

20 subjected to SDS-PAGE and immunoblotting for the presence of the substrate 

protein. The proteins employed in a phosphatase assay may optionally be purified 
proteins. (Lyons, PD et al (2001) J Biol Chem 246:24422-31; Garton, AJ et al 
(1996) Mol Cell Biol 16:6408-18). 

In many embodiments, a cell is manipulated after incubation with a candidate agent 
25 and assayed for a POSH or POSH-AP activity. In certain embodiments a POSH or 
POSH-AP activity is represented by production of virus like particles. As 
demonstrated herein, an agent that disrupts POSH or POSH-AP activity can cause a 
decrease in the production of virus like particles. Other bioassays for POSH or 
POSH-AP activities may include apoptosis assays (e.g., cell survival assays, 
30 apoptosis reporter gene assays, etc.) and NF-kB nuclear localization assays (see e.g., 
Tapon et al. (1998) EMBO J. 17: 1395-1404). One apoptosis assay that may be used 
to assess TGN-associated protein activity is the TUNEL assay, which is used to 
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detect the presence of apoptotic cell death. In the TUNEL assay, the enzyme 
terminal deoxynucleotidyl transferase labels 3 '-OH DNA ends (which are generated 
during apoptosis) with biotinylated nucleotides. The biotinylated nucleotides are 
then detected by immunoperoxidase staining. Another apoptosis assay that may be 
5 used to assess TGN-associated protein activity is the caspase assay, in which caspase 
activity is measured using a blue fluorescent substrate. Cleavage of the substrate by 
caspase 3 releases the fluorochrome, which then fluoresces green. An assay that 
may be employed to monitor cell proliferation associated with a TGN-associated 
protein is the MTT cell proliferation assay. The MTT cell proliferation assay is a 

10 colorimetric assay which measures the reduction of a tetrazolium component (MTT) 
into an insoluble formazan product by the mitochondria of viable cells. After 
incubation of the cells with the MTT reagent, a detergent solution is added to lyse 
the cells and solubilize the colored crystals. The samples may be read using an 
ELISA plate reader. The amount of color produced is directly proportional to the 

1 5 number of viable cells. 

In certain embodiments, POSH or POSH-AP activities may include, without 
limitation, complex formation, ubiquitination and membrane fusion events (eg. 
release of viral buds or fusion of vesicles). POSH-AP activity may be assessed by 
the presence of phosphorylated substrate, such as, in the case of PKA, 

20 phosphorylated POSH. The interaction of POSH with a small GTPase such as Rac 
may also be indicative of the absence of phosphorylation of POSH by PKA. POSH 
complex formation may be assessed by immunoprecipitation and analysis of co- 
immunoprecipiated proteins or affinity purification and analysis of co-purified 
proteins. Fluorescence Resonance Energy Transfer (FRET)-based assays or other 

25 energy transfer assays may also be used to determine complex formation. 

The effect of an agent that modulates the activity of POSH or a POSH-AP 
may bee valuated f or e ffects o n t he t rafficking o f a p rotein t hrough t he s ecretory 
system. For example, the effects of the agent on the trafficking of the protein may 
be assessed by detecting the glycosylation of the protein in the presence and absence 

30 of the agent, for instance, through the use of antibodies specific for sugar moieties. 
For example, cell lysates from cells treated in the absence and presence of an agent 
that modulates the activity of POSH or a POSH-AP may be subjected to 
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immunoprecipitation and immunoblotting with antibodies directed to the 
glycoprotein of interest and the glycosylation state of the protein then compared. 

Additional bioassays for assessing POSH and POSH-AP activities may 
include assays to detect the improper processing of a protein that is associated with a 
5 neurological disorder. One assay that may be used is an assay to detect the presence, 
including an i ncrease o r a d ecrease i n t he amount, o f a p rotein associated w ith a 
neurological disorder. For example, the use of RNAi may be employed to 
knockdown the expression of a POSH or POSH-AP in cells (e.g., CHO cells or COS 
cells). The production of a secreted protein such as for example, amyloid beta, in 
10 the cell culture media, can then be assessed and compared to production of the 
secreted protein from control cells, which may be cells in which the POSH or 
POSH-AP activity has not been inhibited. The production of secreted proteins may 
be assessed, such as amyloid beta protein, which is associated with Alzheimer's 
disease. In some instances, a label may be incorporated into a secreted protein and 
15 the presence of the labeled secreted protein detected in the cell culture media. 
Proteins secreted from any cell type may be assessed, including for example, neural 
cells. 

The effect of an agent that modulates the activity of POSH or a POSH-AP 
may be evaluated for effects on mouse models of various neurological disorders. 
20 For example, mouse models of Alzheimer's disease have been described. See, for 
example, United States Patent No. 5,612,486 for "Transgenic Animals Harboring 
APP Allele Having Swedish Mutation/Tatent No. 5,850,003 (the' 003 patent) for 
'Transgenic Rodents Harboring APP Allele Having Swedish Mutation,"and United 
States Patent No. 5,455,169 entitled "Nucleic Acids for Diagnosing and Modeling. 

25 Alzheimer's Disease". Mouse models of Alzheimer's disease tend to produce 
elevated levels of beta-amyloid protein in the brain, and the increase or decrease of 
such protein in response to treatment with a test agent may be detected. In some 
instances, it may also be desirable to assess the effects of a test agent on cognitive or 
behavioral characteristics of a mouse model for Alzheimer's disease, as well as 

30 mouse models for other neurological disorders. 

In a further embodiment, transcript levels may be measured in cells having 
higher or lower levels of POSH or POSH-AP activity in order to identify genes that 
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are regulated by POSH or POSH-APs. Promoter regions for such genes (or larger 
portions of such genes) may be operatively linked to a reporter gene and used in a 
reporter gene-based assay to detect agents that enhance or diminish POSH- or 
POSH-AP-regulated gene expression. Transcript levels may be determined in any 
5 way known in the art, such as, for example, Northern blotting, RT-PCR, microarray, 
etc. Increased POSH activity may be achieved, for example, by introducing a strong 
POSH expression vector. Decreased POSH activity may be achieved, for example, 
by RNAi, antisense, ribozyme, gene knockout, etc. 

In general, where the screening assay is a binding assay (whether protein- 
10 protein binding, agent-protein binding, etc.), one or more of the molecules may be 
joined to a label, where the label can directly or indirectly provide a detectable 
signal. Various labels include radioisotopes, fluorescers, chemiluminescers, 
enzymes, specific binding molecules, particles, e.g., magnetic particles, and the like. 
Specific b inding m olecules i nclude p airs, s uch as b iotin a nd s treptavidin, d igoxin 
15 and antidigoxin etc. For the specific binding members, the complementary member 
would normally be labeled with a molecule that provides for detection, in 
accordance with known procedures. 

In further embodiments, the application provides methods for identifying 
targets for therapeutic intervention. A polypeptide that interacts with POSH or 
20 participates in a POSH-mediated process (such as viral maturation) may be used to 
identify candidate therapeutics. Such targets may be identified by identifying 
proteins that associated with POSH (POSH-APs) by, for example, 
immunoprecipitation with an anti-POSH antibody, in silico analysis of high- 
throughput binding data, two-hybrid screens, and other protein-protein interaction 
25 assays described herein or otherwise known in the art in view of this disclosure. 
Agents t hat b ind t o s uch t argets o r d isrupt p rotein-protein i nteractions t hereof, o r 
inhibit a biochemical activity thereof may be used in such an assay. Targets that 
have been identified by such approaches include POSH-APs provided in Tables 7 
and 8 and in Figure 36. 

30 A variety of other reagents may be included in the screening assay. These 

include reagents like salts, neutral proteins, e.g., albumin, detergents, etc that are 
used to facilitate optimal protein-protein binding and/or reduce nonspecific or 
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background interactions. Reagents that improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti- microbial agents, etc. maybe used. The 
mixture of components are added in any order that provides for the requisite 
binding. Incubations are performed at any suitable temperature, typically between 4 
5 °C and 40 °C. Incubation periods are selected for optimum activity, but may also be 
optimized to facilitate rapid high-throughput screening. 

In certain embodiments, a test agent may be assessed for antiviral or 
anticancer activity by assessing effects on an activity (function) of a POSH-AP. 
Activity (function) may be affected by an agent that acts at one or more of the 
10 transcriptional, translational or post-translational stages. For example, an siRNA 
directed to a POSH-AP encoding gene will decrease activity, as will a small 
molecule that interferes with a catalytic activity of a POSH-AP. In certain 
embodiments, the agent inhibits the activity of one or more polypeptides selected 
from among HERPUD1 and MSTP028. 

15 

7. Exemplary Nucleic Acids and Expression Vectors 

In certain aspects, the application relates to nucleic acids encoding POSH 
polypeptides, such as, for example, SEQ ID Nos: 2, 5, 7, 9, 11, 26, 27, 28, 29 and 
30. Nucleic acids of the application are further understood to include nucleic acids 
20 that comprise variants of SEQ ID Nos:l, 3, 4, 6, 8, 10, 31, 32, 33, 34, and 35. 
Variant nucleotide sequences include sequences that differ by one or more 
nucleotide substitutions, additions or deletions, such as allelic variants; and will, 
therefore, include coding sequences that differ from the nucleotide sequence of the 
coding sequence designated in SEQ ID Nos:l, 3, 4, 6, 8 10, 31, 32, 33, 34, and 35, 

25 e.g., due to the degeneracy of the genetic code. In other embodiments, variants will 
also include sequences that will hybridize under highly stringent conditions to a 
nucleotide sequence of a coding sequence designated in any of SEQ ID Nos:l, 3, 4, 
6, 8 10, 31, 32, 33, 34, and 35. Preferred nucleic acids of the application are human 
POSH sequences, including, for example, any of SEQ ID Nos: 1, 3, 4, 6, 31, 32, 33, 

30 34, 35 and variants thereof and nucleic acids encoding an amino acid sequence 
selected from among SEQ ID Nos: 2, 5, 7, 26, 27, 28, 29 and 30. 
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In certain aspects, the application relates to nucleic acids encoding POSH-AP 
polypeptides. For example, POSH-APs of the disclosure are listed in Table 7. 
Nucleic acid sequences encoding these POSH-APs are provided in Figure 36. 
Additional examples of POSH-APs of the disclosure are provided in Table 8. In 
5 certain embodiments, variants will also include nucleic acid sequences that will 
hybridize under highly stringent conditions to a nucleotide sequence of a coding 
sequence of a POSH-AP. Preferred nucleic acids of the application are human 
POSH-AP sequences and variants thereof. 

One of ordinary skill in the art will understand readily that appropriate 
10 stringency conditions which promote DNA hybridization can be varied. For 
example, one could perform the hybridization at 6.0 x sodium chloride/sodium 
citrate (SSC) at about 45 °C, followed by a wash of 2.0 x SSC at 50 °C. For 
example, the salt concentration in the wash step can be selected from a low 
stringency of about 2.0 x SSC at 50 °C to a high stringency of about 0.2 x SSC at 50 
15 °C. In addition, the temperature in the wash step can be increased from low 
stringency conditions at room temperature, about 22 °C, to high stringency 
conditions at about 65 °C. Both temperature and salt may be varied, or temperature 
or salt concentration may be held constant while the other variable is changed. In 
one embodiment, the application provides nucleic acids which hybridize under low 
20 stringency conditions of 6 x SSC at room temperature followed by a wash at 2 x 
SSC at room temperature. 

Isolated nucleic acids which differ from the POSH nucleic acid sequences or 
from the POSH-AP nucleic acid sequences due to degeneracy in the genetic code are 
also within the scope of the application. For example, a number of amino acids are 
25 designated by more than one triplet. Codons that specify the same amino acid, or 
synonyms (for example, CAU and CAC are synonyms for histidine) may result in 
"silent" mutations which do not affect the amino acid sequence of the protein. 
However, it is expected that DNA sequence polymorphisms that do lead to changes 
in the amino acid sequences of the subject proteins will exist among mammalian 
30 cells. One skilled in the art will appreciate that these variations in one or more 
nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids encoding a 
particular protein may exist among individuals of a given species due to natural 
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allelic variation. Any and all such nucleotide variations and resulting amino acid 
polymorphisms are within the scope of this application. 

Optionally, a POSH or a POSH-AP nucleic acid of the application will 
genetically complement a partial or complete loss of function phenotype in a cell, 
5 For example, a POSH nucleic acid of the application may be expressed in a cell in 
which endogenous POSH has been reduced by RNAi, and the introduced POSH 
nucleic acid will mitigate a phenotype resulting from the RNAi. An exemplary 
POSH loss of function phenotype is a decrease in virus-like particle production in a 
cell transfected with a viral vector, optionally an HIV vector. 
10 Another aspect of the application relates to POSH and POSH-AP nucleic 

acids that are used for antisense, RNAi or ribozymes. As used herein, nucleic acid 
therapy refers to administration or in situ generation of a nucleic acid or a derivative 
thereof which specifically hybridizes (e.g., binds) under cellular conditions with the 
cellular mRNA and/or genomic DNA encoding one of the POSH or POSH-AP 
15 polypeptides so as to inhibit production of that protein, e.g., by inhibiting 
transcription and/or translation. The binding may be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, through 
specific interactions in the major groove of the double helix. 

A nucleic acid therapy construct of the present application can be delivered, 
20 for example, as an expression plasmid which, when transcribed in the cell, produces 
RNA which is complementary to at least a unique portion of the cellular mRNA 
which encodes a POSH or POSH-AP polypeptide. Alternatively, the the construct is 
an oligonucleotide which is generated ex vivo and which, when introduced into the 
cell causes inhibition of expression by hybridizing with the mRNA and/or genomic 
25 sequences encoding a POSH or POSH-AP polypeptide. Such oligonucleotide 
probes are optionally modified o ligonucleotide which are resistant to endogenous 
nucleases, e.g., exonucleases and/or endonucleases, and is therefore stable in vivo. 
Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also 
30 U.S. Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general 
approaches to constructing oligomers useful in nucleic acid therapy have been 
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reviewed, for example, by van der Krol et al., (1988) Biotechniques 6:958-976; and 
Stein et al., (1988) Cancer Res 48:2659-2668. 

Accordingly, the modified oligomers of the application are useful in 
therapeutic, diagnostic, and research contexts. In therapeutic applications, the 
5 oligomers are utilized in a manner appropriate for nucleic acid therapy in general. 

In another aspect of the application, the subject nucleic acid is provided in an 
expression vector comprising a nucleotide sequence encoding a POSH or POSH-AP 
polypeptide and operably linked to at least one regulatory sequence. Regulatory 
sequences are art-recognized and are selected to direct expression of the POSH or 
10 POSH-AP polypeptide. Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements. Exemplary regulatory 
sequences are described in Goeddel; Gene Expression Technology. Methods in 
Enzymology, Academic Press, San Diego, CA (1990). For instance, any of a wide 
variety of expression control sequences that control the expression of a DNA 
1 5 sequence when operatively linked to it may be used in these vectors to express DNA 
sequences encoding a POSH or POSH-AP polypeptide. Such useful expression 
control sequences, include, for example, the early and late promoters of SV40, tet 
promoter, adenovirus or cytomegalovirus immediate early promoter, the lac system, 
the trp system, the TAC or TRC system, T7 promoter whose expression is directed 
20 by T7 RNA polymerase, the major operator and promoter regions of phage lambda , 
the control regions for fd coat protein, the promoter for 3-phosphoglycerate kinase 
or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the 
promoters of the yeast a-mating factors, the polyhedron promoter of the baculo virus 
system and other sequences known to control the expression of genes of prokaryotic 
25 or eukaryotic cells or their viruses, and various combinations thereof. It should be 
understood that the design of the expression vector may depend on such factors as 
the choice of the host cell to be transformed and/or the type of protein desired to be 
expressed. Moreover, the vector's copy number, the ability to control that copy 
number and the expression of any other protein encoded by the vector, such as 
30 antibiotic markers, should also be considered. 

As will be apparent, the subject gene constructs can be used to cause 
expression of the POSH or POSH-AP polypeptides in cells propagated in culture, 
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e.g., to produce proteins or polypeptides, including fusion proteins or polypeptides, 
for purification. 

This application also pertains to a host cell transfected with a recombinant 
gene including a coding sequence for one or more of the POSH or POSH-AP 
5 polypeptides. The host cell may be any prokaryotic or eukaryotic cell. For 
example, a polypeptide of the present application may be expressed in bacterial cells 
such as E. coli, insect cells (e.g., using a baculovirus expression system), yeast, or 
mammalian cells. Other suitable host cells are known to those skilled in the art. 
Accordingly, t he p resent application further p ertains torn ethods o f p roducing t he 

10 POSH or POSH-AP polypeptides. For example, a host cell transfected with an 
expression vector encoding a POSH polypeptide can be cultured under appropriate 
conditions to allow expression of the polypeptide to occur. The polypeptide may be 
secreted and isolated from a mixture of cells and medium containing the 
polypeptide. Alternatively, the polypeptide may be retained cytoplasmically and the 

15 cells harvested, lysedand the protein isolated. A cell culture includes host cells, 
media and other byproducts. Suitable media for cell culture are well known in the 
art. The polypeptide can be isolated from cell culture medium, host cells, or both 
using techniques known in the art for purifying proteins, including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 

20 immunoafBnity p urification w ith antibodies s pecific f or p articular epitopes o f t he 
polypeptide. In a preferred embodiment, the POSH or POSH-AP polypeptide is a 
fusion protein containing a domain which facilitates its purification, such as a 
POSH-GST fusion protein, POSH-intein fusion protein, POSH-cellulose binding 
domain fusion protein, POSH-polyhistidine fusion protein etc. 

25 A recombinant POSH or POSH-AP nucleic acid can be produced by ligating 

the cloned gene, or a portion thereof, into a vector suitable for expression in either 
prokaryotic cells, eukaryotic cells, or both. Expression vehicles for production of a 
recombinant POSH or POSH-AP polypeptides include plasmids and other vectors. 
For instance, suitable vectors for the expression of a POSH polypeptide include 

30 plasmids of the types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX- 
derived plasmids, pBTac-derived plasmids and pUC-derived plasmids for 
expression in prokaryotic cells, such as E. coli. 
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The preferred mammalian expression vectors contain both prokaryotic 
sequences to facilitate the propagation of the vector in bacteria, and one or more 
eukaryotic transcription units that are expressed in eukaryotic cells. The 
pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, 

5 pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples of 
mammalian expression vectors suitable for transfection of eukaryotic cells. Some of 
these vectors are modified with sequences from bacterial plasmids, such as pBR322, 
to facilitate replication and drug resistance selection in both prokaryotic and 
eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papilloma 

10 virus (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used 
for transient expression of proteins in eukaryotic cells. Examples of other viral 
(including retroviral) expression systems can be found below in the description of 
gene therapy delivery systems. The various methods employed in the preparation of 
the plasmids and transformation of host organisms are well known in the art. For 

15 other suitable expression systems for both prokaryotic and eukaryotic cells, as well 
as general recombinant procedures, see Molecular Cloning A Laboratory Manual, 
2nd E d., e d. b y S ambrook, F ritsch and M aniatis ( Cold S pring H arbor Laboratory 
Press, 1989) Chapters 16 and 17. In some instances, it may be desirable to express 
the recombinant POSH or POSH-AP polypeptide by the use of a baculovirus 

20 expression system. Examples of such baculovirus expression systems include pVL- 
derived vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived vectors 
(such as pAcUWl), and pBlueBac-derived vectors (such as the B-gal containing 
pBlueBac HI). 

Alternatively, the coding sequences for the polypeptide can be incorporated 
25 as a part of a fusion gene including a nucleotide sequence encoding a different 
polypeptide. This type of expression system can be useful under conditions where it 
is d esirable, e .g., t o p roduce an i mmunogenic fragment o f a P OSH o r P OSH-AP 
polypeptide. For example, the VP6 capsid protein of rotavirus can be used as an 
immunologic carrier protein for portions of polypeptide, either in the monomeric 
30 form or in the form of a viral particle. The nucleic acid sequences corresponding to 
the p ortion o f t he P OSH o r P OSH-AP p olypeptide tow hich antibodies a re t o b e 
raised can be incorporated into a fusion gene construct which includes coding 
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sequences for a late vaccinia virus structural protein to produce a set of recombinant 
viruses expressing fusion proteins comprising a portion of the protein as part of the 
virion. The Hepatitis B surface antigen can also be utilized in this role as well. 
Similarly, chimeric constructs coding for fusion proteins containing a portion of a 
5 POSH polypeptide and the poliovirus capsid protein can be created to. enhance 
immunogenicity (see, for example, EP Publication NO: 0259149; and Evans et al.„ 
(1989) Nature 339:385; Huang et al., (1988) J. Virol 62:3855; and Schlienger et al., 
(1992) 7. Virol 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can be 

10 utilized, wherein a desired portion of a POSH or POSH-AP polypeptide is obtained 
directly from organo-chemical synthesis of the peptide onto an oligomeric branching 
lysine core (see, for example, Posnett et al., (1988) JBC 263:1719 and Nardelli et 
al., (1992) J. Immunol 148:914). Antigenic determinants of a POSH or POSH-AP 
polypeptide can also be expressed and presented by bacterial cells. 

15 In another embodiment, a fusion gene coding for a purification leader 

sequence, such as a poly-(His)/enterokinase cleavage site sequence at the N- 
terminus of the desired portion of the recombinant protein, can allow purification of 
the expressed fusion protein by affinity chromatography using a Ni 24 " metal resin. 
The purification leader sequence can then be subsequently removed by treatment 

20 with enterokinase to provide the purified POSH or POSH-AP polypeptide (e.g., see 
Hochuli et al., (1987) J. Chromatography 411:177; and Janknecht et al., PNAS USA 
88:8972). 

Techniques for making fusion genes are well known. Essentially, the joining 
of various DNA fragments coding for different polypeptide sequences is performed 

25 in accordance with conventional techniques, employing blunt-ended or stagger- 
ended termini for ligation, restriction enzyme digestion to provide for appropriate 
termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the 
fusion gene can be synthesized by conventional techniques including automated 

30 DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers which give rise to complementary overhangs 
between two consecutive gene fragments which can subsequently be annealed to 
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generate a chimeric gene sequence (see, for example, Current Protocols in 
Molecular Biology, eds. Ausubel et al., John Wiley & Sons: 1992). 



Table 2: Exemplary POSH nucleic acids 



Sequence Name 


Organism 


Accession Number 








cDNA FIJI 1367 fis, clone 
HEMBA1 000303 


Homo sapiens 


AK021429 


Plenty of SH3 domains 
(POSH) mRNA 


Mus museums 


NM_021506 


Plenty of SH3s (POSH) 
mRNA 


Mus musculus 


AF030131 


Plenty of SH3s (POSH) 
mRNA 


Drosophila melanogaster 


NM_079052 


Plenty of SH3s (POSH) 
mRNA 


Drosophila melanogaster 


AF220364 


Table 3: Exemplary POSH polypeptides 


Sequence Name 


Organism 


Accession Number 








SH3 domains- 
containing protein POSH 


Mus musculus 


T09071 


plenty of SH3 domains 


Mus musculus 


NP_067481 


Plenty of SH3s; POSH 


Mus musculus 


AAC40070 


Plenty of SH3s 


Drosophila melanogaster 


AAF37265 


LD45365p 


Drosophila melanogaster 


AAK93408 


POSH gene product 


Drosophila melanogaster 


AAF57833 
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Plenty of SH3s 



Drosophila melanogaster 



NP 523776 



In addition the following Tables provide the nucleic acid sequence and 
related SEQ ID NOs for domains of human POSH protein and a summary of POSH 
sequence identification numbers used in this application. 



Table 4. Nucleic Acid Sequences and related SEQ ID NOs for domains in human 
POSH 



Name of the 
sequence 


Sequence 


SEQ ID 
NO. 


RING domain 
i 


TGTCCGGTGTGTCTAGAGCGCCTTGATGCTTCTGCGAAGGTCT 
TGCCTTGCCAGCATACGTTTTGCAAGCGATGTTTGCT 

GGGGATCGTAGGTTCTCGAAATGAACTCAGATGTCCCGAGT 


31 


1 st SH 3 
domain 


CCATGTGCCAAAGCGTTATACAACTATGAAGGAAAAGAGCCTG 
GAGACCTTAAATTCAGCAAAGGCGACATCATCATTTT 

GCGAAGACAAGTGGATGAAAATTGGTACCATGGGGAAGTCAAT 
GGAATCCATGGCTTTTTCCCCACCAACTTTGTGCAGA 

TTATT 


32 


2 na SH 3 
domain 


CCTCAGTGCAAAGCACTTTATGACTTTGAAGTGAAAGACAAGG 
AAGCAGACAAAGATTGCCTTCCATTTGCAAAGGATGA 

TGTTCTGACTGTGATCCGAAGAGTGGATGAAAACTGGGCTG7VA 
GGAATGCTGGCAGACAAAATAGGAATATTTCCAATTT 

CATATGTTGAGTTTAAC 


33 


3 rd SH 3 
domain 


AGTGTGTATGTTGCTATATATCCATACACTCCTCGGAAAGAGG 
ATGAACTAGAGCTGAGAAAAGGGGAGATGTTTTTAGT 

GTTTGAGCGCTGCCAGGATGGCTGGTTCAAAGGGACATCCATG 
CATACCAGCAAGATAGGGGTTTTCCCTGGCAATTATG 

TGGCACCAGTC 


34 
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4 ch SH 3 


GAAAGGCACAGGGTGGTGGTTTCCTATCCTCCTCAGAGTGAGG 


35 


domain 


CAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTGT 






TCATAAAAAACGAGAGGATGGCTGGTTCAAAGGCACATTACAA 






CGTAATGGGAAAACTGGCCTTTTCCCAGGAAGCTTTG 






TGGAAAACA 





Table 5. Summary of POSH sequence Identification Numbers 



S^crupncre Inf onnat ion 


Sequence Identification Number 
(SEQ ID NO) 


Human POSH foH i Tier ^erm^tic^ 


SEO ID No- 1 


Human POSH Amino Acid Sequence 


SEQ ID No: 2 


rlU.ll let I J. trKJOn \-,LJL\r\ DctJ L U.Cllv_C 


OHiV 1U JWU • -J 


5' cDNA Fragment of Human POSH 


SEQ ID No: 4 


I>J — Lcl IIl_L.HIJ.fc> riOLclu r i aylilcilL UJ. 

Human POSH 


qpn TD KFo • ^ 

OCi^ IN (J • ZJ 


J UlrCIVJA C X ciyUlfcillU ilVAlUdil irVJoXl 


OJCjV^ XL* X\JU . O 


C- terminus Protein Fragment of 

Unman DHCU 


SEQ ID No: 7 


Mouse POSH mRNA Sequence 


SEQ ID No; 8 


Mouse trvjori frotein oequence 




Drosophila melanogaster POSH 


SEQ ID No: 10 


Drosophila melanogaster POSH 
Protein Sequence 


SEQ ID No: 11 


Human POSH RING Domain Amino 
Acid Sequence 


SEQ ID No: 26 


Human POSH 1 st SH 3 Domain Amino 
Acid Sequence 


SEQ ID NO: 27 


Human POSH 2 nd SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 28 


Human POSH 3 rd SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 29 


Human POSH 4 th SH 3 Domain Amino 
Acid Sequence 


SEQ ID No: 30 


Human POSH RING Domain Nucleic 
Acid Sequence 


SEQ ID NO: 31. 
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POSH loss of function phenotype is a decrease in virus-like particle production in a 
cell transfected with a viral vector, optionally an HIV vector. In certain 
embodiments, a POSH polypeptide, when produced at an effective level in a cell, 
induces apoptosis. 

5 In another aspect, the application provides polypeptides that are agonists or 

antagonists of a POSH or POSH-AP polypeptide. Variants and fragments of a 
POSH or POSH-AP polypeptide may have a hyperactive or constitutive activity, or, 
alternatively, act to prevent POSH or POSH-AP polypeptides from performing one 
or more functions. For example, a truncated form lacking one or more domain may 
1 0 have a dominant negative effect. 

Another aspect of the application relates to polypeptides derived from a full- 
length P OSH or P OSH-AP p olypeptide. Isolated p eptidyl p ortions o f the s ubject 
proteins c an b e o btained b y screening p olypeptides r ecombinantly produced from 
the corresponding fragment of the nucleic acid encoding such polypeptides. In 
15 addition, fragments can be chemically synthesized using techniques known in the art 
such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
any one of the subject proteins can be arbitrarily divided into fragments of desired 
length with no overlap of the fragments, or preferably divided into overlapping 
fragments of a desired length. The fragments can be produced (recombinantly or by 
20 chemical synthesis) and tested to identify those peptidyl fragments which can 
function as either agonists or antagonists of the formation of a specific protein 
complex, or more generally of a POSH:POSH-AP complex, such as by 
microinjection assays. 

It is also possible to modify the structure of the POSH or POSH-AP 
25 polypeptides for such purposes as enhancing therapeutic or prophylactic efficacy, or 
stability (e.g., ex vivo shelf life and resistance to proteolytic degradation in vivo). 
Such modified polypeptides, when designed to retain at least one activity of the 
naturally-occurring form of the protein, are considered functional equivalents of the 
POSH or POSH-AP polypeptides described in more detail herein. Such modified 
30 polypeptides can be produced, for instance, by amino acid substitution, deletion, or 
addition. 
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For instance, it is reasonable to expect, for example, that an isolated 
replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, 
a threonine with a serine, or a similar replacement of an amino acid with a 
structurally related amino acid (i.e,. conservative mutations) will not have a major 
5 effect on the biological activity of the resulting molecule. Conservative replacements 
are those that take place within a family of amino acids that are related in their side 
chains. Genetically encoded amino acids are can be divided into four families (see, 
for example, Biochemistry, 2nd ed., Ed. by L. Stryer, W.H. Freeman and Co., 1981). 
Whether a change in the amino acid sequence of a polypeptide results in a functional 

10 homolog can be readily determined by assessing the ability of the variant 
polypeptide to produce a response in cells in a fashion similar to the wild-type 
protein. For instance, such variant forms of a POSH polypeptide can be assessed, 
e.g., for their ability to bind to another polypeptide, e.g., another POSH polypeptide 
or another protein involved in viral maturation. Polypeptides in which more than 

15 one replacement has taken place can readily be tested in the same manner. 

This application further contemplates a method of generating sets of 
combinatorial mutants of the POSH or POSH-AP polypeptides, as well as truncation 
mutants, and is especially useful for identifying potential variant sequences (e.g., 
homologs) that are functional in binding to a POSH or POSH-AP polypeptide. The 

20 purpose of screening such combinatorial libraries is to generate, for example, POSH 
homologs which can act as either agonists or antagonist, or alternatively, which 
possess novel activities all together. Combinatorially-derived homologs can be 
generated which have a selective potency relative to a naturally occurring POSH or 
POSH-AP polypeptide. Such proteins, when expressed from recombinant DNA 

25 constructs, can be used in gene therapy protocols. 

Likewise, mutagenesis can give rise to homologs which have intracellular 
half-lives dramatically different than the corresponding wild-type protein. For 
example, the altered protein can be rendered either more stable or less stable to 
proteolytic degradation or other cellular process which result in destruction of, or 

30 otherwise inactivation of the POSH or POSH-AP polypeptide of interest. Such 
homologs, and the genes which encode them, can be utilized to alter POSH or 
POSH-AP levels by modulating the half-life of the protein. For instance, a short 
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half-life can give rise to more transient biological effects and, when part of an 
inducible expression system, can allow tighter control of recombinant POSH or 
POSH-AP levels within the cell. As above, such proteins, and particularly their 
recombinant nucleic acid constructs, can be used in gene therapy protocols. 
5 In similar fashion, POSH or POSH-AP homologs can be generated by the 

present combinatorial approach to act as antagonists, in that they are able to interfere 
with the ability of the corresponding wild-type protein to function. 

In a representative embodiment of this method, the amino acid sequences for 
a population of POSH or POSH-AP homologs are aligned, preferably to promote the 

10 highest homology possible. Such a population of variants can include, for example, 
homologs from one or more species, or homologs from the same species, but which 
differ due to mutation. Amino acids which appear at each position of the aligned 
sequences are selected to create a degenerate set of combinatorial sequences. In a 
preferred embodiment, the combinatorial library is produced by way of a degenerate 

15 library of genes encoding a library of polypeptides which each include at least a 
portion of potential POSH or POSH-AP sequences. For instance, a mixture of 
synthetic oligonucleotides can be enzymatically ligated into gene sequences such 
that the degenerate set of potential POSH or POSH-AP nucleotide sequences are 
expressible as individual polypeptides, or alternatively, as a set of larger fusion 

20 proteins (e.g., for phage display). 

There are many ways by which the library of potential homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be carried out in an automatic DNA synthesizer, and 
the synthetic genes then be ligated into an appropriate gene for expression. The 

25 purpose of a degenerate set of genes is to provide, in one mixture, all of the 
sequences encoding the desired set of potential POSH or POSH-AP sequences. The 
synthesis of degenerate oligonucleotides is well known in the art (see for example, 
Narang, SA (1983) Tetrahedron 39:3; Itakura et al., (1981) Recombinant DNA, 
Proc. 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: 

30 Elsevier pp273-289; Itakura et al., (1984) Annu. Rev. Biochem. 53:323; Itakura et 
al., (1984) Science 198:1056; Dee et al., (1983) Nucleic Acid Res. 11:477). Such 
techniaues have been employed in the directed evolution of other proteins (see, for 
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example, Scott et al., (1990) Science 249:386-390; Roberts et al., (1992) PNAS 
USA 89:2429-2433; Devlin et al., (1990) Science 249: 404-406; Cwirla et al., 
(1990) PNAS USA 87: 6378-6382; as well as U.S. Patent Nos: 5,223,409, 
5,198,346, and 5,096,815). 
5 Alternatively, other forms of mutagenesis can be utilized to generate a 

combinatorial 1 ibrary . F or e xample, P OSH o r P OSH- AP h omologs ( both a gonist 
and antagonist forms) can be generated and isolated from a library by screening 
using, for example, alanine scanning mutagenesis and the like (Ruf et al., ( 1994) 
Biochemistry 33:1565-1572; Wang et al., (1994) J. Biol. Chem. 269:3095-3099; 

10 Balint et al., ( 1993) Gene 1 37:109-118; Grodberg et al., ( 1993) Eur. J. Biochem. 
218:597-601; Nagashima et al., (1993) J. Biol. Chem. 268:2888-2892; Lowman et 
al., (1991) Biochemistry 30:10832-10838; and Cunningham et al., (1989) Science 
244:1081-1085), by linker scanning mutagenesis (Gustin et al., (1993) Virology 
193:653-660; Brown et al., (1992) Mol. Cell Biol. 12:2644-2652; McKnight et al., 

15 (1982) Science 232:316); by saturation mutagenesis (Meyers et al., (1986) Science 
232:613); by PCR mutagenesis (Leung et al., (1989) Method Cell Mol Biol 1:11- 
19); or by random mutagenesis, including chemical mutagenesis, etc. (Miller et al., 
(1992) A Short Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, NY; 
and Greener et al., (1994) Strategies in Mol Biol 7:32-34). Linker scanning 

20 mutagenesis, particularly in a combinatorial setting, is an attractive method for 
identifying truncated (bioactive) forms of POSH or POSH-AP polypeptides. 

A wide range of techniques are known in the art for screening gene products 
of combinatorial libraries made by point mutations and truncations, and, for that 
matter, for screening cDNA libraries for gene products having a certain property. 

25 Such techniques will be generally adaptable for rapid screening of the gene libraries 
generated by the combinatorial mutagenesis of POSH or POSH-AP homologs. The 
most widely used techniques for screening large gene libraries typically comprises 
cloning the gene library into replicable expression vectors, transforming appropriate 
cells with the resulting library of vectors, and expressing the combinatorial genes 

30 under conditions in which detection of a desired activity facilitates relatively easy 
isolation of the vector encoding the gene whose product was detected. Each of the 
illustrative assays described below are amenable to high through-put analysis as 
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necessary to screen large numbers of degenerate sequences created by combinatorial 
mutagenesis techniques. 

In an illustrative embodiment of a screening assay, candidate combinatorial 
gene products of one of the subject proteins are displayed on the surface of a cell or 
5 virus, and the ability of particular cells or viral particles to bind a POSH or POSH- 
AP polypeptide is detected in a "panning assay". For instance, a library of POSH 
variants can be cloned into the gene for a surface membrane protein of a bacterial 
cell (Ladner et al.„ WO 88/06630; Fuchs. et al., (1991) Bio/Technology 9:1370- 
1371; and Goward et al., (1992) TIBS 18:136-140), and the resulting fusion protein 
10 detected by panning, e.g., using a fluorescently labeled molecule which binds the 
POSH polypeptide, to score for potentially functional homologs. Cells can be 
visually inspected and separated under a fluorescence microscope, or, where the 
morphology of the cell permits, separated by a fluorescence-activated cell sorter. 

In similar fashion, the gene library can be expressed as a fusion protein on 
1 5 the surface of a viral particle. For instance, in the filamentous phage system, foreign 
peptide sequences can be expressed on the surface of infectious phage, thereby 
conferring two significant benefits. First, since these phage can be applied to 
affinity matrices at very high concentrations, a large number of phage can be 
screened at one time. Second, since each infectious phage displays the 
20 combinatorial gene product on its surface, if a particular phage is recovered from an 
affinity matrix in low yield, the phage can be amplified by another round of 
infection. The group of almost identical E. coli filamentous phages Ml 3, fd, and fl 
are most often used in phage display libraries, as either of the phage gin or gVEI 
coat proteins can be used to generate fusion proteins without disrupting the ultimate 
25 packaging of the viral particle (Ladner et al., PCT publication WO 90/02909; 
Garrard et al., PCT publication WO 92/09690; Marks et al., (1992) J. Biol. Chem. 
267:16007-16010; Griffiths et al., (1993) EMBO J. 12:725-734; Clackson et al., 
(1991) Nature 352:624-628; and Barbas et al., (1992) PNAS USA 89:4457-4461). 

The application also provides for reduction of the POSH or POSH-AP 
30 polypeptides to generate mimetics, e.g., peptide or non-peptide agents, which are 
able to mimic binding of the authentic protein to another cellular partner. Such 
mutaeenic techniques as described above, as well as the thioredoxin system, are also 
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particularly useful for mapping the determinants of a POSH or POSH-AP 
polypeptide which participate in protein-protein interactions involved in, for 
example, binding of proteins involved in viral maturation to each other. To 
illustrate, the critical residues of a POSH or POSH-AP polypeptide which are 
5 involved in molecular recognition of a substrate protein can be determined and used 
to generate its derivative peptidomimetics which bind to the substrate protein, and 
by inhibiting POSH or POSH-AP binding, act to inhibit its biological activity. By 
employing, for example, scanning mutagenesis to map the amino acid residues of a 
POSH polypeptide which are involved in binding to another polypeptide, 
10 peptidomimetic compounds can be generated which mimic those residues involved 
in binding. For instance, non-hydrolyzable peptide analogs of such residues can be 
generated using benzodiazepine (e.g., see Freidinger et al., in Peptides: Chemistry 
and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), 
azepine (e.g., see Huffinan et al., in Peptides: Chemistry and Biology, G.R. Marshall 
15 ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gamma lactam rings 
(Garvey et al., in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et 
al., (1986) J. Med. Chem. 29:295; and Ewenson et al., in Peptides: Structure and 
Function (Proceedings of the 9th American Peptide Symposium) Pierce C hemical 
20 Co. Rockland, IL, 1985), b-turn dipeptide cores (Nagai et al., (1985) Tetrahedron 
Lett 26:647; and Sato et al., (1986) J Chem Soc Perkin Trans 1:1231), and b- 
aminoalcohols (Gordon et al., (1985) Biochem Biophys Res Commun 126:419; and 
Dann et al., (1986) Biochem Biophys Res Commun 134:71). 

The following table provides the sequences of the RING domain and the 
25 various SH3 domains of POSH. 



Table 6. Amino Acid Sequences and related SEQ ID NOs for domains in human 
POSH 



Name of 
the 

sequence 


Sequence 


SEQ ID 
NO. 


RING 


CPVCLERIjDASAKVLPCQHTFCKRCIjLGIVGSRNELRCPEC 


26 
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domain 






l 8t SH 3 
domain 


PCAKALYNYEGKEPGDLKFSKGDI 1 1 LRRQVDENWYHGEVNGIHGF 
FPTNFVQIIK 


27 


2 nd SH 3 
domain 


PQCKALYDFEVKDKEADKDCLPFAKDDVLTVIRRVM 
KIGIFPISYVEFNS 


28 


3 rd SH 3 
domain 


SVYVAIYPYTPRKEDELEBRKGEMFIiVFERCQDGWFKGTSMHTSKI 
GVFPGNYVAPVT 


29 


4** SH 3 
domain 


ERHRVWSYPPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKT 
GLFPGSFVENI 


30 



The following table provides a list of selected POSH-APs and their related 
SEQ ID NOs. 



5 Table 7 - Selected POSH APs 



Protein 


Protein Sequence 
(SEQ ID NO:) 


mRNA Sequence 
(SEQDD NO:) 


ARF1 


223 


325-339 


ARF5 


224 


340-344 


ATP6V0C 


225-226 


345-351 


CBL-B 


361; 398; 227-230 


353-360 


CENTB1 


231-232 


37-47 


DDEF1 


233-237 


48-54 


EEF3S3 


238 


55-57 


EPS8L2 


239 


58-60 


GO C API 


240-243 


61-68 


GOSR2 


244-248 


69-76 


HERPUD1 


249-252 


77-86 


HLA-A 


253 


87-88 


HLA-B 


254 


89 


MSTP028 


255-256 


90-94 


PACS-1 


362-366 


95-100 


PPP1CA 


261-263; 395 


101-110 


PRKAR1A 


264-265 


111-122; 396-397 


PTPN12 


266-268 


123-129 


RALA 


269-270 


130-134 


SIAH1 


271-272 


135-141 


SMN1 


273-275 


142-146 


SMN2 


276-280 


147-151 


SNX1 


281-286 


152-161 


SNX3 


287-290 


162-174 
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Protein 


Protein Sequence 


mRNA Sequence 




(SEQ ID NO:) 


(SEQ ID NO:) 


SRA1 


291-294 


175-182 


SYNE1 


295-307 


183-201 


TTC3 


308-312 


202-207 


UBE2N 


313 


208-210 


UNC84B 


314 


211-213 j 


VCY2IP1 


315-323 


214-222 


SPG20 


386-388 


367-374 


WASF1 


389 


375-376 


HIP55 


390-394 


377-385 



Table 8 below provides a list of POSH-APs that bound POSH in a 2-hybrid 
5 assay. Nucleic acid and amino acid sequences of the POSH-APs listed in Table 8 
were filed in a U.S. provisional application filed in the name of Daniel N. Taglicht, 
Ms Alroy, Yuval Reiss, Liora Yaar, Danny Ben-Avraham, Shmuel Tuvia, and 
Tsvika Greener entitled "Posh Interacting Proteins and Related Methods", filed on 
March 2, 2004 (Attorney Docket No. PROL-P79-024), which Provisional 
1 0 Application is incorporated herein by reference in its entirety. 



Table 8 -POSH-APs 



Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


BCL9 - var 1 


4757846 


4757845 


BRD4 - var 1 


19718731 


19718730 


BRD4 — var 2 


7657218 


7657217 


DRP2 - var 1 


4503393 


4503392 


MAPlA-var 1 


21536458 


21536457 


SH2D2A-var 1 


4503633 


31543620 


B AT3 — var 1 


18375630 


18375633 


BAT3 - var 2 


18375634 


18375631 


BAT3 - var 3 


* 


18375629 


BCARl-varl 


7656924 


7656923 


DAP - var 1 


4758120 


4758119 


EVPL-var 1 


4503613 


4503612 


FIJI 3231 -var 1 


38604073 


38604072 


FL53657-var 1 


13376230 


13376229 


HSPC142-var 1 


7661802 


7661801 


LOCI 18987- var 1 


29789403 


31341089 


NAP4 - var 1 


2443367 


2443366 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


RBAF600 — var 1 


24416002 


24416001 


XTP3TPB - var 1 


20070264 


20070263 


Hs.31535-var 1 


37546355 


37546354 ' 


ASF1B - var 1 


8922549 


8922548 


ATP5A1 - var 1 


4757810 


23346425 


C6 or fl 1- var 1 


9954875 


39725662 


C6orf60-var 1 


24431997 


24431996 


CDTl-varl 


16418337 


19923847 


CIC-var 1 ! 


16507208 


16507207 


CLK2 - var 1 


4557477 


4557476 


CLK2 - var 2 


4502883 


4502882 


DNM2 - var 1 


4826700 


4826699 


EEF1A1 — var 1 


4503471 


25453469 


EIF4EBP1 - var 1 


4758258 


20070179 


FLJ13479-var 1 


24432013 


39725704 


GC20 - var 1 


5031711 


5031710 


GLUL - var 1 


19923206 


21361767 


HEBP2 - var 1 


7657603 


7657602 


ITGB- var 1 


4504779 


4504778 


LAMAS — var 1 


21264602 


21264601 


LOC90987-var 1 


29734345 


29734344 ! 


MRPL36 - var 1 


23111040 


20806105 


Hs.380933-var 1 


30149441 


37550602 


NQ02-var 1 


4505417 


4505416 


PCBPl-varl 


5453854 


14141164 


PCNT2 - var 1 


22035674 


35493922 


PGD - var 1 


984325 


984324 


RAP80 - var 1 


21361593 


21361592 


RNH - var 1 


21361547 


21361546 


RPL-var 1 


4506597 


15431291 


RPS20 - var 1 


4506697 


14591915 


RPS27A - var 1 


4506713 


27436941 


SETDB1 - var 1 


6912652 


6912651 


SF3A2 - var 1 


21361376 


32189413 


UBB - var 1 


11024714 


22538474 


ARHV-var 1 


20070360 


20070359 


KIAAllll-varl 


32698700 


32698699 


ZNF147-var 1 


4827065 


15208652 


PAWR-var 1 


4505613 


4505612 


TPX2 - var 1 


20127519 


31542258 


HSPAlB-var 1 


4885431 


26787974 


DLG5 - var 1 


3043690 


3650451 


DLG5 - var 2 


28466997 


28466996 


1 DLG5 - var 3 


3650452 


16549841 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


DLG5 - var 4 




16807129 


DLG5 - var 5 


* 


22539637 


DLG5 - var 6 


* 


15929207 


DLG5 - var 7 




3043689 


KIAA1598- var 1 


7023592 


7023591 


KIAA1598- var 2 


10047271 


7018519 


KIAA1598- var 3 




21314680 


KIAA1598- var 4 


* 


10047270 


KIAA1598 - var 5 


* 


21755030 


KIAA1598- var 6 ! 


* 


21755023 


KIAA1 598 - var 7 


* 


21754670 


KIAA1598- var 8 


* 


21750902 


KIAA1598- var 9 




21749984 


KIAA1598- var 10 




21749775 


, KIAA1598- var 11 




21749737 


CGI-27 - var 1 


7705720 


23270696 


CGI-27 — var 2 


* 


22902234 


CGI-27 - var 3 


* 


17046302 


CGI-27 - var 4 


* 


16553689 


CGI-27 - var 5 


* 


10433504 


CGI-27 - var 6 


* 


4680692 


CGI-27 - var 7 


* 


20127543 


BIA2 - var 1 

JJjJXl » V CXI X 


5262640 


5262639 


BIA2 — var 2 


21591225 


21591224 


BIA2 - var 3 


* 


21755615 


COLIA1 - var 1 


180392 


407589 


COLIA1 - var 2 


180857 


30015 


COLIA1 - var 3 


1418928 


30092 


COLIAl - var 4 


22328092 


7209641 


COLIAl - var 5 


762938 


22328091 


COLIAl - var 6 


30016 


1418927 


COLIAl - var 7 


407590 


180856 


COLIAl - var 8 

V> X^l..« XX » w* w 




180391 


COLIAl - var 9 


* 


14719826 


DKFZp761 A052 - var 1 


10434104 


10434103 


DKFZt/761 A052 - var 2 


10439058 


10439057 


DKFZp761 A052 - var 3 


14602829 


14602828 


DKFZp761 A052 - var 4 


20380411 


15079884 


DKFZp761 A052 - var 5 


6808165 


20380410 


DKFZp761 A052 - var 6 


* 


6808164 


TLEl-var 1 


14603281 


16041735 


TLE1 — var 2 


307510 


14603280 


TLE1 - var 3 




307509 


EGLN2 - var 1 


8922130 


23273571 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


EGLN2 — var 2 


12804603 


10437903 


EGLN2 - var 3 


14547148 


21733075 


EGLN2 - var 4 


18031805 


21758140 


EGLN2 — var 5 


* 


18677002 


EGLN2 - var 6 


* 


18031804 


EGLN2 - var 7 


* 


18141576 


EGLN2 — var 8 


* 


14547147 


EGLN2 - var 9 


* 


12804602 


EGLN2-var 10 


* 


10439822 


EGLN2 - var 1 1 


* 


8922129 


STC2 - var 1 


3335144 


3335143 


STC2 - var 2 


* 


3702223 


STC2 - var 3 


* 


4050037 ! 


STC2 - var 4 


* 


4104014 


STC2 - var 5 


* 1 


13623494 


STC2 - var 6 


*. 


14042507 


STC2-var7 


* 


14042032 


STC2 - var 8 


* 


21755241 


STC2 - var 9 


* 


21755207 


STC2-var 10 


* 


22761473 


STC2 - var 1 1 


* 


12653744 


OPTN - var 1 


20149572 


16550123 


OPTN - var 2 


21619683 


3387890 


OPTN — var 3 


3329431 


3127082 


OPTN — var 4 


3127083 


3329430 


OPTN — var 5 


* 


21619682 


OPTN — var 6 


* 


18644681 


OPTN - var 7 


* 


18644683 


OPTN -var 8 


* 


18644685 


OPTN- var 9 


* 


20149571 


FU37147-var 1 


21753535 


21753534 


FU37147-var2 


30153743 


30153742 


KHDRBS1 - var 1 


21749696 


189499 


KHDP3S1 - var 2 


1841747 


12653852 


KHDRBS1 — var 3 


189500 


17512262 


KHDRBS1 - var 4 


* 


14714433 


KHDRBS1 -var 5 


* 


1841746 


KHDRBS1 — var 6 


* 


21749695 


SLC2A1 - var 1 


3387905 


3387904 


SLC2Al-var2 


5730051 


5730050 


SLC2Al-var3 


14268550 


14268549 


DKFZp434B1231 - var 1 


6808117 


6808116 


NTJMA1 - var 1 


27694103 


5453819 


NUMA1 - var 2 


35119 


13278785 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


NUMA1 - var 3 


14249928 


14249927 


NUMA1 - var 4 


13278786 


15991876 


NUMA1 - var 5 


5453820 


296118 


NUMA1 - var 6 


* 


296119 


NUMA1 - var 7 


* 


296120 


NUMA1 - var 8 


* 


35118 


NUMA1 - var 9 


* 


20073234 


NUMA1 - var 10 


* 


22477305 


NUMA1 - var 11 


* 


22749583 


NUMA1 - var 12 


* 


27694102 


HSPC016-var 1 


6841310 


12654536 


HSPC016-var2 


12654537 


6841309 


HSPC016-var3 


* 


4679017 


HSPC016-var4 


* 


10834763 


UBC - var 1 


5912028 


3360475 


UBC - var 2 


340058 


2647407 


UBC- var 3 


340068 


24657521 


UBC - var 4 


14286308 


21751700 


UBC -var 5 


15928840 


21757163 


UBC - var 6 


16552475 


21758959 


UBC - var 7 


* 


16552474 


UBC - var 8 


* 


15928839 


UBC- var 9 


* 


14286307 


UBC -var 10 


* 


12653358 


UBC - var 1 1 


* 


10439801 


UBC -var 12 


* 


340067 


UBC -var 13 


* 


340057 


UBC -var 14 


* 


5912027 


ZFM1 - var 1 


785999 


785998 


PIASY-var 1 


14603164 


3643110 


PIASY - var 2 


5533373 


5533372 


PIASY-var 3 


24850133 


10433892 


PIASY -var 4 


3643111 


14603163 


PIASY -var 5 


* 


20987516 


. PIASY -var 6 


* 


14709019 


XM 208944 -var 1 


30153743 


30153742 


J03930 - var 1 


178442 


178441 


MT2A — var 1 


187528 


37120 


MT2 A - var 2 


37121 


263506 


MT2A - var 3 


* 


13937856 


MT2A — var 4 


* 


1495465 


MT2 A - var 5 


* 


187527 


EWSR1 - var 1 


7669490 


21734132 


EWSR1 - var 2 


12653511 


547565 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


EWSR1 - var 3 


15029675 


21756356 


EWSR1 - var 4 


16552153 


16551673 


EWSR1 - var 5 


16551674 


16552152 


EWSR1 - var 6 


31280 


15029674 


EWSR1 - var 7 


* 


13435962 


EWSR1 - var 8 


* 


12653510 


EWSR1 - var 9 


* 


10439073 


EWSR1 - var 10 


* 


7669489 


MADH6 - var 1 


2828712 


1654326 


MADH6 - var 2 


2736316 


20379504 


MADH6 — var 3 


1654327 


2736315 


MADH6 - var 4 


* 


2828711 


MADH6 - var 5 


* 


15278059 


THOC2 - var 1 


20799318 


10435649 


THOC2 - var 2 


10435650 


20799317 


THOC2 - var 3 


* 


7023224 


ZNF151 — var 1 


676873 


2230870 


ZNF151 - var 2 


2230871 


676872 


DDX31 - var 1 


10435700 


14042193 


DDX31 - var 2 


10440004 


15215272 


DDX31 - var 3 


20336298 


16566549 


DDX31 - var 4 


16566550 


20336297 


DDX31 - var 5 


15215273 


20336296 


DDX31 - var 6 


14042194 


10440003 


DDX31 - var 7 


* 


10435699 


POLR2J2 - var 1 


11595478 


21704271 


POLR2J2 - var 2 


21704274 


21704270 


POLR2J2 - var 3 


19401711 


19401710 


POLR2J2 - var 4 


14702175 


21704273 


POLR2J2 - var 5 


21704272 


16878085 


POLR2J2 - var 6 


* 


11595475 


POLR2J2 - var 7 


* 


11595477 


POLR2J2 - var 8 


* 


11595473 


BANF1 - var 1 


3002951 


11038645 


BANF1 - var 2 


4502389 


13543576 


BANF1 - var 3 


* 


14713907 


BANF1 - var 4 


* 


3002950 


BANF1 - var 5 


* 


4321975 


BANF1 - var 6 


* 


3220254 


CBX4-var 1 


1945453 


1945452 


CBX4-var2 


15929016 


2317722 


CBX4-var3 


2317723 


15929015 


ARIH2- var 1 


3925604 


3925603 


ARIH2 - var 2 


9963793 


3930777 
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Protein and Variant 


Protein Sequence 
(public gi No.) 


mRNA Sequence j 
(public gl INo.) I 


ARJH2- var 3 


12653307 


3986675 j 


ARM2-var4 


* 


3986676 


ARJH2-var 5 


* 


3986677 ) 


ARIH2 - var 6 


* 


7328049 1 


ARJH2-var7 


* 


6855602 


ARJH2-var8 


* 


21749565 1 


ARIH2-var9 


* 


33875424 j 


ARIH2 - var 1 0 


* 


9963792 


ARIH2 — var 1 1 


* 


5453556 J 


ARIH2- var 12 


* 


5817100 I 


ARIH2 - var 13 


* 


3930775 j 


SRPK2 - var 1 


1857944 


21752284 


SRPK2 - var 2 


23270876 


21749007 


SRPK2 - var 3 


* 


23270875 H 


SRPK2 - var 4 


* 


1857943 1 


SIAH2 - var 1 


2673968 


16549991 J 


SlAH2-var2 


2664283 


34189635 [ 


SIAH2-var3 


* 


2664282 | 


SIAH2 - var 4 




2673967 


KIAA0191-var 1 


27480017 


29387261 


KIAA0191 - var 2 


1228035 


10438300 J 


KIAA0191-var3 


29387262 


1228034 


KIAA0191 - var 4 


* 


21755057 ] 


KIAA0191 - var 5 


* 


27480016 


KIAA0191 - var 6 


* 


19387907 j 


KIAA0191 - var 7 


* 


15636651 


KIAA0191-var8 


* 


23273514 j 


PA1-RBP1 - var 1 


5262551 


22760761 } 


PA1-RBP1 - var 2 


4929579 


20072477 ~J 


PA1-RBP1 - var 3 


12804377 


•* /-^ /^V ^ f\_ At ^ A*" \ 

17939456 1 


PA1-RBP1 - var 4 


12803339 


18088243 


PA1-RBP1 - var 5 


14029171 


16924316 _j 


PA1-RBP1 - var 6 


18088244 


33872286 ! 


PA1-RBP1 - var 7 


22760762 


14029170 ] 


PA1-RBP1 - var 8 


* 


33876749 


PA1-RBP1 - var 9 


* 


12804376 


PA1-RBP1 - var 10 


* 


4929578 J 


PA1-RBP1 - var 11 




*rtUUUJ7 j 


PA1-RBP1 - var 12 




5262550 J 


FAT -var 1 


2281025 


1107686 J 


FAT - var 2 


1107687 


15214611 j 


FAT -var 3 


* 


2281024 


FAT - var 4 


* 


598748 ZJ 


VCL-var 1 


24657579 


7669551 J 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) 


(public gi No.) 


VCL-var2 


340237 


7669549 


VCL-var3 


7669550 


340236 


VCL-var4 


* 


21732673 


VCL-var5 


* 


15426616 


VCL-var6 


* 


246657578 


SSR4 - var 1 


15929882 


30583222 


SSR4-var2 


13097213 


1071680 


SSR4-var3 


* 


22749791 


SSR4-var4 


* 


21753447 


SSR4-var5 


* 


16552704 


SSR4-var6 


* 


L 15929881 


SSR4-var7 


* 


13097212 


SSR4-var8 


* 


2398656 


PRDX5 - var 1 


6166493 


27484966 


PRDX5 - var 2 


6746355 


9802047 


PRDX5-var3 


9802048 


8745393 


PRDX5-var4 


27484967 


6746354 


PRDX5-var5 


* 


6563211 


PRDX5-var6 


* 


6103723 


PRDX5 - var 7 


* 


6166492 


PRDX5 - var 8 


* 


6523288 


PRDX5 - var 9 


* 


32455258 


FLJ10120-varl 


8922239 


27469671 


FLJ10120-var2 


* 


8922238 


PROL4 - var 1 


22208536 


22208535 


PROL4 - var 2 


6005802 


1050982 


CL25084-var 1 


15341891 


4406555 


CL25084-var2 


7023472 


. 4406692 


CL25084-var3 


4406693 


7023471 


CL25084-var4 


4406556 


15341890 


Cllorfl7-var 1 


22761313 


21361869 


Cllorfl7-var2 


21105773 


20149226 


Cllorfl7-var3 


20149225 


20149224 


Cllorfl7-var4 


20149227 


21105772 


Cllorfl7-var5 


21361870 


21410957 


Cllorfl7-var 6 


* 


22761312 


POLQ - var 1 


3510695 


13892060 


POLQ - var 2 


4163931 


13892060 


POLQ -var 3 


13892061 


4163930 


POLQ -var 4 


* 


3510694 


MBD2 - var 1 


3170202 


3800812 


MBD2 — var 2 


3800801 


5817231 


MBD2 - var 3 


7710145 


21595775 


MBD2 - var 4 


21595776 


21464120 
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Protein and Variant 


Protein Sequence 


mRNA Sequence 




(public gi No.) i 


(public gi No.) 


MBD2 - var 5 


* 


21464121 


MBD2 — var 6 


* 


3800800 


MBD2 - var 7 


* 


3800792 


MBD2 - var 8 


* 


3170201 


FSTLl-varl 


12658309 


536897 


FSTL1 - var 2 


12652619 


16924272 


FSTLl-var3 


* 


33990756 


FSTL1 - var 4 


* 


12658308 


FSTL1 - var 5 


* 


10438502 


FSTLl-var6 




4884472 



* denotes a polypeptide sequence that can be deduced from the corresponding 
mRNA sequence. 

5 

9. Effective Dose 

Toxicity and therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
for determining The LD50 (the dose lethal to 50% of the population) and the ED50 
10 (the dose therapeutically effective in 50% of the population). The dose ratio 
between toxic and therapeutic effects is the therapeutic index and it can be expressed 
as the ratio LD50/ED50. Compounds which exhibit large therapeutic induces are 
preferred. While compounds that exhibit toxic side effects may be used, care should 
be taken to design a delivery system that targets such compounds to the site of 
15 affected tissue in order to minimize potential damage to uninfected cells and, 
thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used 
in formulating a range of dosage for use in humans. The dosage of such compounds 
lies p referably w ithin a range o f c irculating c oncentrations t hat i nclude t he E D50 
20 with little or no toxicity. The dosage may vary within this range depending upon the 
dosage form employed and the route of administration utilized. For any compound 
used in the method of the application, the therapeutically effective dose can be 
estimated initially from cell culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration range that includes the IC50 
25 (i.e., the concentration of the test compound which achieves a half-maximal 
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inhibition of symptoms) as determined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. Levels in plasma may be 
measured, for example, by high performance liquid chromatography. 

5 10. Formulation and Use 

Pharmaceutical compositions for use in accordance with the present 
application may be formulated in conventional maimer using one or more 
physiologically acceptable carriers or excipients. Thus, the compounds and their 
physiologically acceptable salts and solvates may be formulated for administration 
10 by, for example, injection, inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

An exemplary composition of the application comprises an RNAi mixed 
with a delivery system, such as a liposome system, and optionally including an 
acceptable excipient. In a preferred embodiment, the composition is formulated for 
1 5 topical administration for, e.g., herpes virus infections. 

For such therapy, the compounds of the application can be formulated for a 
variety of loads of administration, including systemic and topical or localized 
administration. Techniques and formulations generally may be found in 
Remmington's P harmaceutical S ciences, M eade P ublishing C o., E aston, PA. F or 
20 systemic administration, injection is preferred, including intramuscular, intravenous, 
intraperitoneal, and subcutaneous. For injection, the compounds of the application 
can be formulated in liquid solutions, preferably in physiologically compatible 
buffers such as Hank's solution or Ringer's solution. In addition, the compounds 
may be formulated in solid form and redissolved or suspended immediately prior to 
25 use. Lyophilized forms are also included. 

For oral administration, the pharmaceutical compositions may take the form 
of, for example, tablets or capsules prepared by conventional means with 
pharmaceutically acceptable excipients such as binding agents (e.g., pregelatinised 
maize starch, polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., 
30 lactose, microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., 
magnesium stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). The tablets may be 
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coated by methods well known in the art. Liquid preparations for oral 
administration may take the form of, for example, solutions, syrups or suspensions, 
or they may be presented as a dry product for constitution with water or other 
suitable vehicle before use. Such liquid preparations may be prepared by 
5 conventional means with pharmaceutically acceptable additives such as suspending 
agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., ationd oil, 
oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., 
methyl o r p ropyl-p-hydroxybenzoates o r s orbic acid). T he p reparations m ay also 
10 contain buffer salts, flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give 
controlled release of the active compound. For buccal administration the 
compositions may take the form of tablets or lozenges formulated in conventional 
manner. For administration by inhalation, the compounds for use according to the 
15 present application are conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol the dosage unit may be determined by providing a valve to 
20 deliver a metered amount. Capsules and cartridges of e.g., gelatin for use in an 
inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for injection 
25 may be presented in unit dosage form, e.g., in ampoules or in multi-dose containers, 
with an added preservative. The compositions may take such fonns as suspensions, 
solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the 
active i ngredient m ay b e i n p owder f orm for c onstitution w ith a suitable v ehicle, 
30 e.g., sterile pyrogen-free water, before use. 
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The compounds may also be formulated in rectal compositions such as 
suppositories o r r etention e nemas, e .g., c ontaining c onventional s uppository b ases 
such as cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may 

5 also be formulated as a depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with 
suitable polymeric or hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for 

1 0 example, as a sparingly soluble salt. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration bile salts and 

15 fusidic acid derivatives, in addition, detergents maybe used to facilitate permeation. 
Transmucosal administration may be through nasal sprays or using suppositories. 
For topical administration, the oligomers of the application are formulated into 
ointments, salves, gels, or creams as generally known in the art. A wash solution 
can be used locally to treat an injury or inflammation to accelerate healing. 

20 The compositions may, if desired, be presented in a pack or dispenser device 

which may contain one or more unit dosage forms containing the active ingredient. 
The pack may for example comprise metal or plastic foil, such as a blister pack. The 
pack or dispenser device may be accompanied by instructions for administration. 

For therapies involving the administration of nucleic acids, the oligomers of 

25 the application can be formulated for a variety of modes of administration, including 
systemic and topical or localized administration. Techniques and formulations 
generally may be found in Remmington's Pharmaceutical Sciences, Meade 
Publishing Co., Easton, PA. For systemic administration, injection is preferred, 
including intramuscular, intravenous, intraperitoneal, intranodal, and subcutaneous 

30 for injection, the oligomers of the application can be formulated in liquid solutions, 
preferably in physiologically compatible buffers such as Hank's solution or Ringer's 
solution. In addition, the oligomers may be formulated in solid form and 
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redissolved or suspended immediately prior to use. Lyophilized forms are also 
included. 

Systemic administration can also be by transmucosal or transdermal means, 
or the compounds can be administered orally. For transmucosal or transdermal 
5 administration, penetrants appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art, and include, for 
example, for transmucosal administration bile salts and fiisidic acid derivatives. In 
addition, detergents may be used to facilitate permeation. Transmucosal 
administration may be through nasal sprays or using suppositories. For oral 
10 administration, the oligomers are formulated into conventional oral administration 
forms such as capsules, tablets, and tonics. For topical administration, the oligomers 
of the application are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

15 The application now being generally described, it will be more readily 

understood by reference to the following examples, which are included merely for 
purposes of illustration of certain aspects and embodiments of the present 
application, and are not intended to limit the application. 

20 EXAMPLES 

Example 1 . Role of POSH in virus-like particle CVJJP) budding 
1. Objective: 

Use RNAi to inhibit POSH gene expression and compare the efficiency of 
viral budding and GAG expression and processing in treated and untreated cells. 
25 2. Study Plan: 

HeLa SS-6 cells are transfected with mRNA-specific RNAi in order to 
knockdown the target proteins. Since maximal reduction of target protein by RNAi 
is achieved after 48 hours, cells are transfected twice - first to reduce target mRNAs, 
and subsequently to express the viral Gag protein. The second transfection is 
30 performed with pNLenv (plasmid that encodes HIV) and with low amounts of RNAi 
to maintain the knockdown of target protein during the time of gag expression and 
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budding of VLPs. Reduction in mRNA levels due to RNAi effect is verified by RT- 
PCR amplification of target mRNA. 
3. Methods, Materials, Solutions 

a. Methods 

5 i. Transfections according to manufacturer's protocol and as described in 

procedure. 

ii. Protein determined by Bradford assay. 

iii SDS-PAGE in Hoeffer miniVE electrophoresis system. Transfer in Bio- 
Rad mini-protean II wet transfer system. Blots visualized using Typhoon system, 
1 0 and ImageQuant software (ABbiotech) 

b. Materials 



Material 


Manufacturer 


Catalog # 


Batch # 


Lipofectamine 2000 
(LF2000) 


Life Technologies 


11668-019 


1112496 


OptiMEM 


Life Technologies 


31985-047 


3063119 


RNAi Lamin A/C 


Self 


13 




RNAi TSG101 688 


Self 


65 




RNAi Posh 524 


Self 


81 




plenvll PTAP 


Self 


148 




plenvll ATAP 


Self 


149 




Anti-p24 polyclonal 
antibody 


Seramun 




A-0236/5- 
10-01 


Anti-Rabbit Cy5 
conjugated antibody 


Jackson 


144-175-115 


48715 


10% acrylamide Tris- 
Glycine SDS-PAGE gel 


Life Technologies 


NP0321 


1081371 


Nitrocellulose 
membrane 


Schleicher & 
Schuell 


401353 


BA-83 


NuP AGE 20X transfer 
buffer 


Life Technologies 


NP0006-1 


224365 


0.45 um filter 


Schleicher & 


10462100 


CS1018-1 
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Schuell . 







c. Solutions 



Lysis Buffer 


Compound 


Concentration 


Tris-HClpH7.6 


50mM 


MgCl 2 


15mM 


NaCl 


150mM 


Glycerol 


10% 


EDTA 


ImM 


EGTA 


ImM 




ASB-14 (add immediately 
before use) 


1% 


6X Sample 
Buffer 


Tris-HCl, pH=6.8 


1M 


Glycerol 


30% 


SDS 


10% 


DTT 


9.3% 


Bromophenol Blue 


0.012% 


TBS-T 


Tris pH=7.6 


20mM 


NaCl 


137mM 


Tween-20 


0.1% 



4. Procedure 
5 a. Schedule 



Day 


1 


2 


3 


• 4 


5 


Plate 


Transfection 


Passage 


Transfection II 


Extract RNA 


cells 


I 


cells 


(RNAi and 


for RT-PCR 




(RNAi only) 


(1:3) 


pNlenv) 


(post 








(12:00, PM) 


transfection) 
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Extract RNA for 

RT-PCR 

(pre-transfection) 


Harvest VLPs 
and cells 



b. Dayl 

Plate HeLa SS-6 cells in 6-well plates (35mm wells) at concentration of 5 XI 0 5 
cells/well. 

c. Day 2 

2 hours before transfection replace growth medium with 2 ml growth medium 
without antibiotics. 

Transfection I: 

RNAi A B 
RNAi [nM] [20mM] OPtiMEM LF2000 mix 

Reaction RNAi name TAGDA# Reactions pi (pi) (MO 



1 


Lamin A/C 


13 


2 


50 


12.5 


500 500 


2 


Lamin A/C 


13 


1 


50 


6.25 


250 250 


3 


TSG101 688 


65 


2 


20 


5 


500 500 


5 


Posh 524 


81 


2 


50 


12.5 


500 500 



10 Transfections: 

Prepare LF2000 mix: 250 ^1 OptiMEM + 5 nl LF2000 for each reaction. Mix by 
inversion, 5 times. Incubate 5 minutes at room temperature. 
Prepare RNA dilution in OptiMEM (Table 1, column A). Add LF2000 mix 
dropwise to diluted RNA (Table 1, column B). Mix by gentle vortex. Incubate at 

15 room temperature 25 minutes, covered with aluminum foil. 

Add 500 jlxI transfection mixture to cells dropwise and mix by rocking side to 
side. 

Incubate overnight. 

d. Day 3 

20 Split 1 :3 after 24 hours. (Plate 4 wells for each reaction, except reaction 2 which 

is plated into 3 wells.) 

e. Day 4 
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2 hours pre-transfection replace medium with DMEM growth medium without 
antibiotics. 



Transfection II 



B 



RNAi 

Plasmid [20pM] for 

RNAi TAG Reaction for 2.4 pg 10nM OPtiMEM LF2000 mix 

name DA# Plasmid # (mO (mO (pi) (Ml) 
Lamin 3.4 

A/C 13 PTAP 3 3.75 750 750 
Lamin 2.5 

A/C 13 ATAP 3 3.75 750 750 
TSG101 3.4 

688 65 PTAP 3 3.75 750 750 

Posh 524 81 PTAP 3 3.4 3.75 < 750 750 

5 Prepare LF2000 mix: 250 \i\ OptiMEM + 5 \ii LF2000 for each reaction. Mix by 

inversion, 5 times. Incubate 5 minutes at room temperature. 
Prepare RNA+DNA diluted in OptiMEM (Transfection II, A+B+C) 
Add LF2000 mix (Transfection II, D) to diluted RNA+DNA dropwise, mix by 
gentle vortex, and incubate Ih while protected from light with aluminum foil. 
10 Add LF2000 and DNA+RNA to cells, 500^1/well, mix by gentle rocking and 

incubate overnight. 

f. Day 5 

Collect samples for VLP assay (approximately 24 hours post-transfection) by the 
following procedure (cells from one well from each sample is taken for RNA 
1 5 assay, by RT-PCR). 

g. Cell Extracts 

i. Pellet floating cells by centrifugation (5min, 3000 rpm at 4 °C), save 
supernatant (continue with supernatant immediately to step h), scrape 
remaining cells in the medium which remains in the well, add to the 
20 corresponding floating cell pellet and centrifuge for 5 minutes, 1800rpm at 

4°C. 
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ii. Wash cell pellet twice with ice-cold PBS. 

iii. Resuspend cell pellet in 100 jxl lysis buffer and incubate 20 minutes on 
ice. 

iv. Centrifuge at 14,000 rpm for 15 min. Transfer supernatant to a clean 
5 tube. This is the cell extract. 

v. Prepare 10 jxl of cell extract samples for SDS-PAGE by adding SDS- 
PAGE sample buffer to IX, and boiling for 10 minutes. Remove an aliquot 
of the remaining sample for protein determination to verify total initial 
starting material. Save remaining cell extract at -80 °C 

10 h. Purification of VLPs from cell media 

i. Filter the supernatant from step g through a 0.45m filter. . 

ii. Centrifuge supernatant at 14,000 rpm at 4 °C for at least 2 h. 

iii. Aspirate supernatant carefully. 

iv. Re-suspend VLP pellet in hot (100 °C warmed for 10 min at least) IX 
15 sample buffer. 

v. Boil samples for 10 minutes, 100 °C. 
i. Western Blot analysis 

i. Rim all samples from stages A and B on Tris-Glycine SDS-PAGE 10% 
(120V for 1.5 h). 

20 ii. Transfer samples to nitrocellulose membrane (65V for 1.5 h). 

iii. Stain membrane with ponceau S solution. 

iv. Block with 10% low fat milk in TBS-T for 1 h. 

v. Incubate with anti p24 rabbit 1 :500 in TBS-T o/n. 

vi. Wash 3 times with TBS-T for 7 min each wash. 

25 vii. Incubate with secondary antibody anti rabbit cy5 1 :500 for 30 min. 

viii. Wash five times for 10 min in TBS-T. 

ix. View in Typhoon gel imaging system (Molecular Dynamics/APBiotech) 
for fluorescence signal. 

Results are shown in Figures 11-13. 



30 



Example 2. Exemplary POSH RT-PCR primers and siRNA duplexes 
RT-PCR primers 
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Name 


Position 


Sequence 


Sense primer 


POSH=271 


271 


5 1 CTTGCCTTGCCAGCATAC 3* (SEQ ID NO: 12) 


Anti-sense 
primer 


POSH=926c 


926C 


5* CTGCCAGCATTCCTTCAG 3' (SEQIDNO:13) j 



siRNA duplexes: 

siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



153 

POSH-230 
426-446 

5' AACAGAGGCCTTGGAAACCTG 3' 

5' dTdTCAGAGGCCUUGGAAACCUG 3' 

5'dTdTCAGGUUUCCAAGGCCUCUG 3' 



SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 



10 siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 

15 siRNA anti-sense strand: 



155 

POSH-442 
638-658 

5' AAAGAGCCTGGAGACCTTAAA 3' 

5' ddTdTAGAGCCUGGAGACCUUAAA 3' 

5' ddTdTUUUAAGGUCUCCAGGCUCU 3' 



SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 



20 



siRNA No: 
siRNA Name: 
Position in mRNA 
Target sequence: 
siRNA sense strand: 
siRNA anti-sense strand: 



157 

POSH-U111 
2973-2993 

5' AAGGATTGGTATGTGACTCTG 3' 
5' dTdTGGAUUGGUAUGUGACUCUG 3' 
5' dTdTCAGAGUCACAUACCAAUCC 3' 



SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 



siRNA No: 
25 siRNA Name: 

Position in mRNA 
Target sequence: 
siRNA sense strand: 



159 

POSH-U410 
3272-3292 

5' AAGCTGGATTATCTCCTGTTG 3' 
5' ddTdTGCUGGAUUAUCUCCUGUUG 3' 



SEQ ID NO: 
SEQ ID NO: 
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siRNA anti-sense strand: 5' ddTdTCAACAGGAG AUAAUCCAGC 3' SEQ ID NO: 

siRNA No.: 187 
siRNA Name: POSH-control 
5 Position in mRNA: None. Reverse to #153 

Target sequence: 5' AAGTCCAAAGGTTCCGGAGAC 3 9 SEQ ID 

NO: 36 

3. Knock-down of hPOSH entraps HIV virus particles in intracellular vesicles. 

10 HIV virus release was analyzed by electron microscopy following siRNA 

and full-length HIV plasmid (missing the envelope coding region) transfection. 
Mature viruses were secreted by cells transfected with HIV plasmid and non- 
relevant siRNA (control, lower panel). Knockdown of TsglOl protein resulted in a 
budding defect, the viruses that were released had an immature phenotype (upper 

15 panel). Knockdown of hPOSH levels resulted in accumulation of viruses inside the 
cell in intracellular vesicles (middle panel). Results, shown in Figure 28, indicate 
that inhibiting hPOSH entraps HIV virus particles in intracellular vesicles. As 
accumulation of HIV virus particles in the cells accelerate cell death, inhibition of 
hPOSH therefore destroys HIV reservoir by killing cells infected with HIV. 

20 

Example 4. In-vitro assay of Human POSH self-ubiquitination 

Recombinant hPOSH was incubated with ATP in the presence of El, E2 and 

ubiquitin as indicated in each lane. Following incubation at 37 °C for 30 minutes, 
25 reactions w ere t erminated b y a ddition o f S DS-PAGE s ample b uffer. T he s amples 

were subsequently resolved on a 10% polyacrylamide g el. The separated samples 

were then transferred to nitrocellulose and subjected to immunoblot analysis with an 

anti ubiquitin polyclonal antibody. The position of migration of molecular weight 

markers is indicated on the right. 
30 Poly-Ub: Ub-hPOSHconjugates, detected as high molecular weight adducts only in 

reactions containing El, E2 and ubiquitin. hPOSH-176 and hPOSH-178 are a short 
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and a longer derivatives (respectively) of bacterially expressed hPOSH; C, 
E3. 

Preliminary steps in a high-throughput screen 
Materials 

5 1 . El recombinant from bacculovirus 

2. E2 Ubch5c from bacteria 

3. Ubiquitin 

4. POSH #178 (1-361) GST fusion-purified but degraded 

5. POSH #176 (1-269) GST fusion-purified but degraded 
10 6. hsHRD 1 soluble ring containing region 

5. Bufferxl2 (Tris 7.6 40 mM, DTT ImM, MgCl 2 5mM, ATP 2uM) 

6. Dilution buffer (Tris 7.6 40mM, DTT ImM, ovalbumin lug/ul) 



protocol 





0. lug/ul 


0.5ug/ul 


5ug/ul 


0.4ug/ul 


2.5ug/u/ 


0.8ug/ul 






El 


E2 


Ub 


176 


178 


Hrdl 


Bxl2 


-El (E2+176) 




0.5 


0.5 








10 


-E2 (El+176) 


1 




0.5 








9.5 .. 


-ub (E1+E2+176) 


1 


0.5 










9.5 ; 


El+E2+176+Ub 


1 


0.5 


0.5 








9 


-El (E2+178) 




0.5 


0.5 




1 




10 


-E2 (El+178) 


1 




0.5 




1 




9.5 


-ub (E1+E2+178) 


1 


0.5 






1 




9.5 


El+E2+178+Ub 


1 


0.5 


0.5 




1 




9 


1 Hrdl, El+E2+Ub 


1 


0.5 


0.5 






1 


8.5 

* 



15 1 . Incubate for 3 0 minutes at 37 °C. 

2. Run 12% SDS PAGE gel and transfer to nitrocellulose membrane 

3. Incubate with anti-Ubiquitin antibody. 

Results, shown in Figure 19, demonstrate that human POSH has 
ubiquitin ligase activity. 

20 
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Example 5. Co-immunoprecipitation of hPOSH with mvc-tagged activated CV12) 
and dominant-negative fN17) Racl 

HeLa cells were transfected with combinations of myc-Racl VI 2 or Nl 7 and 
hPOSHdelRING-V5. 24 hours after transfection (efficiency 80% as measured by 
5 GFP) cells were collected, washed with PBS, and swollen in hypotonic lysis buffer 
(10 mM HEPES pH=7.9, 15 mM KC1, 0.1 mM EDTA, 2 mM MgC12, 1 mM DTT, 
and protease inhibitors). Cells were lysed by 10 strokes with dounce homogenizer 
and centrifuged 3000xg for 10 minutes to give supernatant (Fraction 1) and nucleii. 
Nucleii were washed with Fraction 2 buffer (0.2% NP-40, 10 mM HEPES pH=7.9, 

10 40 mM KC1, 5% glycerol) to remove peripheral proteins. Nucleii were spun-down 
and supernatant collected (Fraction 2). Nuclear proteins were eluted in Fraction 3 
buffer (20 mM HEPES pH=7.9, 0.42 M KC1, 25% glycerol, 0.1 mM EDTA, 2 mM 
MgCk, 1 mM DTT) by rotating 30 minutes in cold. Insoluble proteins were spun- 
down 14000xg and solubilized in Fraction 4 buffer (1% Fos-Choline 14, 50 mM 

15 HEPES pH=7.9, 150 mM NaCl, 10% glycerol, ImM EDTA, 1.5 mM MgCl 2 , 2 mM 
DTT). Half of the total extract was pre-cleared against Protein A sepharose for 1.5 
hours and used for IP with 1 ^g anti-myc (9E10, Roche 1-667-149) and Protein A 
sepharose for 2 hours. Immune complexes were washed extensively, and eluted in 
SDS-PAGE sample buffer. Gels were run, and proteins electro-transferred to 

20 nitrocellulose for immunoblot as in Figure 20. Endogenous POSH and transfected 
hPOSHdelRING-V5 are precipitated as a complex with Myc-Racl V12/N17. 
Results, shown in Figure 2 0, demonstrate that POSH c o-immunoprecipitates with 
Racl. 

25 Example 6. POSH reduction results in decreased secretion of phospholipase D 
(PLD) 

Hela SS6 cells (two wells of 6-well plate) were transfected with POSH 
siRNA or control siRNA (100 nM). 24 hours later each well was split into 5 wells of 
30 a 24- well plate. The next day cells were transfected again with 100 nM of either 
POSH siRNA or control siRNA. The next day cells were washed three times with 
IxPBS and than 0.5 ml of PLD incubation buffer (118 mM NaCl, 6 mM KC1, 1 mM 
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CaCl 2 , 1.2 raM MgS04, 12.4 mM HEPES, pH7.5 and 1% fatty acid free bovine 
serum albumin) were added. 

48 hours later medium was collected and centrifuged at 800xg for 15 
minutes. The medium was diluted with 5xPLD reaction buffer (Amplex red PLD kit) 

5 and assayed for PLD by using the Amplex Red PLD kit (Molecular probes, A- 
12219). The assay results were quantified and presented below in as a bar graph. 
The cells were collected and lysed in 1% Triton X-100 lysis buffer (20 mM HEPES- 
NaOH, pH 7.4, 150 mM NaCl, 1.5 mM MgCl 2 , 1 mM EDTA, 1% Triton X-100 and 
lx protease inhibitors) for 15 minutes on ice. Lysates were cleared by centrifugation 

10 and protein concentration was determined. There were equal protein concentrations 
between the two transfectants. Equal amount of extracts were immunoprecipitated 
with anti-POSH antibodies, separated by SDS-PAGE and immunoblotted with anti- 
POSH antibodies to assess the reduction of POSH levels. There was approximately 
40% reduction in POSH levels (Figure 21). 

15 

Example 7. Effect of hPOSH on Gae-EGFP intracellular distribution 

HeLa SS6 were transfected with Gag-EGFP, 24 hours after an initial 
transfection with either hPOSH-specific or scrambled siRNA (control) (lOOnM) or 
with plasmids encoding either wild type hPOSH or hPOSH C(12,55)A. Fixation 

20 and staining was preformed 5 hours after Gag-EGFP transfection. Cells were fixed, 
stained with Alexa fluor 647-conjugated Concanavalin A (ConA) (Molecular 
Probes), permeabilized and then stained with sheep anti-human TGN46. After the 
primary antibody incubation cells were incubated with Rhodamin-conjugated goat 
anti-sheep. Laser scanning confocal microscopy was performed on LSM510 

25 confocal microscope (Zeiss) equipped with Axiovert 100M inverted microscope 
using x40 magnification and 1.3-numerical-aperture oil-immersion lens for imaging. 
For co-localization experiments, 10 optical horizontal sections with i ntervals of 1 
jim were taken through each preparation (Z-stack). A single median section of each 
preparation is shown. See Figure 22. 

30 

Example 8. POSH-Regulated Intracellular Transport of Mvristovlated Proteins 

-134- 



WO 2004/078130 



PCT/US2004/006308 



The localization of myristoylated proteins, Gag (see Figure 22), HIV-1 Nef, 
Src and Rapsyn, in cells depleted of hPOSH were analyzed by immunofluorescence. 
In control cells, HTV-1 Nef was found in a perinuclear region co-localized with 
hPOSH, indicative of a TGN localization (Figure 23). When hPOSH expression was 
5 reduced by siRNA treatment, Nef expression was weaker relative to control and nef 
lost its TGN, perinuclear localization. Instead it accumulated in punctated 
intracellular loci segregated from the TGN. 

Src is expressed at the plasma membrane and in intracellular vesicles, which 
are found close to the plasma membrane (Figure 24, HI 87 cells). However, when 

10 hPOSH levels were reduced, Src was dispersed in the cytoplasm and loses its plasma 
membrane proximal localization detected in control (HI 87) cells (Figure 24, 
compare HI 53-1 and HI 87-2 panels). 

Rapsyn, a peripheral membrane protein expressed in skeletal muscle, plays a 
critical role in organizing the structure of the nicotinic postsynaptic membrane 

15 (Sanes and Lichtman, Annu. Rev. Neurosci. 22: 389-442 (1999)). Newly 

synthesized Rapsyn associates with the TGN and than transported to the plasma 
membrane (Marchand et al., J. Neurosci. 22: 8891-01 (2002)). In hPOSH-depleted 
cells (H153-1) Rapsyn was dispersed in the cytoplasm, while in control cells it had a 
punctuated pattern and plasma membrane localization, indicating that hPOSH 

20 influences its intracellular transport (Figure 25). 

Materials and Methods Used: 
• Antibodies: 

Src antibody was purchased from Oncogene research products( Darmstadt, 
25 Germany). Nef antibodies were pusrchased from ABI (Columbia, MA) and 
Fitzgerald Industries Interantional (Concord, MA). Alexa Fluor conjugated 
antibodies were pusrchased from Molecular Probes Inc. (Eugene, OR). 

hPOSH antibody: Glutathione S-transferase (GST) fusion plasmids were 
constructed by PCR amplification of hPOSH codons 285-430. The amplified PCR 
30 products was cloned into pGEX-6P-2 (Amersham Pharmacia Biotech, 

Buckinghamshire, UK). The truncated hPOSH protein was generated in E. coli 
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BL21 . Bacterial cultures were grown in LB media with carbenicillin (100 ng/ml) 
and recombinant protein production was induced with 1 mM IPTG for 4 hours at 30 
°C. Cells were lysed by sonication and the recombinant protein was then isolated 
from the cleared bacterial lysate by affinity chromatography on a glutathione- 
5 sepharose resin (Amersham Pharmacia Biotech, Buckinghamshire, UK). The 
hPOSH portion of the fusion protein was then released by incubation with 
PreScission protease (Amersham Pharmacia Biotech, Buckinghamshire, UK) 
according to the manufacturer's instructions and the GST portion was then removed 
by a second glutathione-sepharose affinity chromatography. The purified partial 
10 hPOSH polypeptide was used to immunize New Zealand white rabbits to generate 
antibody 15B (Washington Biotechnology, Baltimore, Maryland). 

• Construction of siRNA retroviral vectors: 

hPOSH scrambled oligonucleotide (5'- CACACACTGCCG TCAACT 
GTTCAAGAGAC AGTTGACGGCAGTGTGTGTTTTTT -3'; and 5'- 

1 5 AATT AAAAAAC AC A C ACTGCCGTC AACTGTC TCTTGAAC AGTTGA 
CGGCAGTGTGTGGGCC -3') were annealed and cloned into the Apal-EcoRI 
digested pSilencer 1.0-US (Ambion) to generate pSIL-scrambled. Subsequently, the 
U6-promoter and RNAi sequences were digested with BamHI, the ends filled in and 
the insert cloned into the Olil site in the retroviral vector, pMSVhyg (Clontech), 

20 generating pMSCVhyg-U6-scrambled. hPOSH oligonucleotide encoding RNAi 

against hPOSH (5 '-AACAGAGGCCTTGGAAA CCTGGAAGC TTGCAGGTTT 
CCAAGGCCTCTGTT -3'; and 5'- GATCAACAGAG GCCTTGGAAACCTGC 
AAGCTTCCAGGTTTCCAA GGCCTCTGTT -3') were annealed and cloned into 
the BamHI-EcoRI site of pLIT-U6, generating pLIT-U6 hPOSH-230. pLIT-U6 is an 

25 shRNA vector containing the human U6 promoter (amplified by PCR from human 
genomic DNA with the primers, S'-GGCCCACTAGTCA AGGTCG GGCA 
GGAAGA- 3' and 5'- GCCGAATT CAAAAAGGATC CGGCGATATCCGG 
TGTTTCGTCCTTTCCA -3') cloned into pLITMUS38 (New England Biolabs) 
digested with Spel-EcoRI. Subsequently, the U6 promoter-hPOSH shRNA (pLIT- 

30 U6 hPOSH-230 digested with SnaBI and Pvul) was cloned into the Olil site of 
pMSVhyg (Clontech), generating pMSCVhyg U6-hPOSH-230. 
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• Generation of stable clones: 

HEK 293T cells were transfected with retroviral RNAi plasmids 
(pMSCVhyg-U6-POSH-230 and pMSCVhyg-U6-scrambled and with plasmids 
encoding VSV-G and moloney gag-pol. Two days post transfection, medium 

5 containing retroviruses was collected and filtered and polybrene was added to a final 
concentration of 8|ag/ml. This was used to infect HeLa SS6 cells grown in 60 mm 
dishes. Forty-eight hours post-infection cells were selected for RNAi expression by 
the addition of hygromycin to a final concentration of 300 fig/ml. Clones expressing 
RNAi against hPOSH were named H153, clones expressing scrambled RNAi were 

10 named HI 87. 

• Transfection and immunofluorescent analysis: 
Gag-EGFP experiments are described in Figure 22. 

HI 53 or HI 87 cells were transfected with Src or Rapsyn-GFP (Image clone 
image: 3530551 or pNLenv-1). Eighteen hours post transfection cells were washed 

1 5 with P BS a nd i ncubated o n i ce with A lexa F luor 6 47 c onjugated C on A t o 1 abel 
plasma membrane glycoproteins. Subsequently cells were fixed in 3% 
paraformaldehyde, blocked with PBS containing 4% bovine serum albumin and 1% 
gelatin. Staining with rabbit anti-Src, rabbit anti-hPOSH (15B) or mouse anti-nef 
was followed with secondary antibodies as indicated. 

20 Laser scanning confooal microscopy was performed on LSM510 confocal 

microscope (Zeiss) equipped with Axiovert 100M inverted microscope using x40 
magnification and L3-numerical-aperture oil-immersion lens for imaging. For co- 
localization experiments, 10 optical horizontal sections with intervals of 1 yon were 
taken through each preparation (Z-stack). A single median section of each 

25 preparation is shown. 

Exam ple 9. POSH Reduction bv siRNA Abrogates West Nile Virus ("WNV") 
Infectivitv. 

HeLa SS6 cells were transfected with either control or POSH-specific 
30 siRNA. Cells were subsequently infected with WNV (4xl0 4 PFU/well). Viruses 
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were harvested 24 hours and 48 hours post-infection, serially diluted, and used to 
infect Vero cells. As a control WNV (4x1 0 4 PFU/well), that was not passed through 
HeLa SS6 cells, was used to infect Vero cells. Virus titer was determined by plaque 
assay in Vero cells. 

5 Virus titer was reduced by 2.5-log in cells treated with POSH-specific 

siRNA relative to cells transfected with control siRNA, thereby indicating that 
WNV requires POSH for virus secretion. See Figure 26. 

Experimental Procedure: 

10 • Cell culture, transfections and infection: 

Hela SS6 cells were grown in Dulbecco's modified Eagle's medium 
(DMEM) supplemented with 10% heat-inactivated fetal calf serum and 100 units/ml 
penicillin and 100 jag/ml streptomycin. For transfections, HeLa SS6 cells were 
grown to 50% confluency in DMEM containing 10% FCS without antibiotics. Cells 

1 5 were then transfected with the relevant double-stranded siRNA (1 00 nM) using 
lipofectamin 2000 (Invitrogen, Paisley, UK). On the day following the initial 
transfection, cells were split 1:3 in complete medium and transfected with a second 
portion of double-stranded siRNA (50 nM). Six hours post-transfection medium was 
replaced and cells infected with WNV (4x1 0 4 PFU/well). Medium was collected 

20 from infected HeLa SS6 cells twenty-four and forty-eight post-infection (200 jal), 
serially diluted, and used to infect Vero cells. Virus titer was determined by plaque 
assay (Ben-Nathan D, Lachmi B, Lustig S, Feuerstien G (1991) Protection of 
dehydroepiandrosterone (DHEA) in mice ifected with viral encephalitis. Arch Viro; 
120, 263-271). 

25 

Exam ple 10. Analysis of the effects of POSH knockdown on M-Mu LV expression 
and budding 
Experimental Protocol: 
Transfections:- 

30 A day before transfection, Hela SS6 cells were plated in two 6 wells plates at 

5 xlO 5 cells per well. 24 hours later the following transfections were performed: 
4 wells were transfected with control siRNA and a plasmid encoding MMuLV. 
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4 wells were transfected with POSH siRNA and a plasmid encoding MMuLV. 
1 well was a control without any siRNA or DNA transfected. 
1 well was transfected with a plasmid encoding MMuLV. 

For each well to be transfected 100 nM (12.5 ui) POSH siRNA or 100 nM 

5 (12.5 \xl) control siRNA were diluted in 250 ^1 Opti-MEM (Invitrogen). 

Lipofectamin 2000 (5 pJ) (Invitrogen, Cat. 1 1668-019) was mixed with 250 jxl of 
OptiMEM per transfected well. The diluted siRNA was mixed with the lipofectamin 
2000 mix and the solution incubated at room temperature for 30 min. The mixture 
was added directly to each well containing 2 ml DMEM +10% FBS (w/o 

10 antibiotics). 

24 hours later, four wells of the same siRNA treatment were split to eight 
wells, and two wells without siRNA were split to four wells. 

24 hours later all wells were transfected with 100 nM control siRNA or 100 
nM POSH siRNA with or without a plasmid encoding MMuLV (see table below). 
15 48 hours later virions and cells were harvested. 



No of 
wells 


RNAi 


Amount 
of RNAi 
Oil) per 
well 


Amount of DNA 
(ug) per well 


The volume of 
DNA (pi) per 
well 


Application 


5 


POSH 
100 nM (1 st 
and 2 nd 
transfection) 


12.5 


MMuLV (2 ug) 


10 


4 wells for 
VLPs assay 
and 1 well for 
RT 


5 


Control 
100 nM (1 st 
and 2 nd 
transfection) 


12.5 


MMuLV (2 ug) 


10 


4 wells for 
VLPs assay 
and 1 well for 
RT 


1 








10 ul H 2 0 


VLPs assay 


1 






MMuLV (2 ug) 


10 


VLPs assay 



Steady state VLP assay 
Cell extracts:- 

20 L Pellet floating cells by centrifugation (10 min, 500xg at 4 °C), save 

supernatant (continued at step 7), wash cells once, scrape cells in ice-cold 
lxPBS, add to the corresponding cell pellet and centrifuge for 5 min 1800 
rpm at 4 °C. 
2. Wash cell pellet once with ice-cold lxPBS. 
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3. Resuspend cell pellet in 150 pJ 1% Triton X-100 lysis buffer (20 mM 
HEPES-NaOH, pH 7.4, 150 mM NaCl, 1.5 mM MgCl 2 , 1 mM EDTA, 1% 
Triton X-100 and lx protease inhibitors) and incubate 20 minutes on ice. 

4. Centrifuge at 14,000rpm for 15 min. Transfer supernatant to a clean tube. 
5 5. Determine protein concentration by BCA. 

6. Prepare samples for SDS-PAGE by adding 2 |il of 6xSB to 20 jig extract 
(add lysis buffer to a final volume of 12 pi), heat to 80 °C for 10 min. 

Purification of virions from cell media 
10 7. Filtrate the supernatant through a 0.45 |im filter. 

8. Transfer 1500 jil of virions fraction to an ultracentrifuge tube (swinging 
rotor). 

9. Add 300 jil of fresh sucrose cushion (20% sucrose in TNE) to the bottom of 
the tube. 

15 10. Centrifuge supernatant at 35000 rpm at 4 °C for 2 hr. 

11. Resuspend virion pellet in 50 \xl hot lx sample buffer each (samples 153-1, 
2, 3, 187-1, 2, 3). Resuspend VLPs pellet (153-4, 5 and 1 87 4, 5) in 25 p.1 hot 
lx sample buffer. Vortex shortly, transfer to an eppendorf tube, unite VLPs 
from wells 153-4+5 and 187- 4+5. Heat to 80 °C for 10 min. 
20 12. Load equal amounts of VLPs relatively to cells extracts amounts. 

Western Blot analysis 

1. Separate all samples on 12% SDS-PAGE. 

2. Transfer samples to nitrocellulose membrane (1 00V for 1 . 1 5 hr). 

3. Dye membrane with ponceau solution. 

25 4. Block with 10% low fat milk in TBS-T for 1 hour. 

5. Incubate membranes with Goat anti p30 (81S-263) (1 :5000) in 10% low fat 
milk in TBS-T over night at 4 °C. Incubate with secondary antibody rabbit 
anti goat-HRP 1 :8000 for 60 min at room temperature. 

6. Detect signal by ECL reaction. 

30 7. Following the ECL detection incubate memebranes with Donkey anti rabbit 

Cy3 (Jackson Laboratories, Cat 71 1-165-152) 1:500 and detect signal by 
Typhoon scanning and quantitate. 
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Results: 

As shown in Figure 27, POSH knockdown decreases the release of 
extracellular MMuLV particles. 

5 

F.xam ple 1 1 . POSH Protein-protein interactions bv veast tw o hybrid assay 

POSH-associated proteins were identified by using a yeast two-hybrid assay. 
Procedure: 

Bait plasmid (GAL4-BD) was transformed into yeast strain AH109 
10 (Clontech) and transformants were selected on defined media lacking tryptophan. 
Yeast strain Yl 87 containing pre-transformed Hela cDNA prey (GAL4-AD) library 
(Clontech) was mated according to the Clontech protocol with bait containing yeast 
and plated on defined media lacking tryptophan, leucine, histidine and containing 2 
mM 3 amino triazol. Colonies that grew on the selective media were tested for beta- 
1 5 galactosidase activity and positive clones were further characterized. Prey clones 
were identified by amplifying cDNA insert and sequencing using vector derived 
primers. 
Bait: 

Plasmid vector: pGBK-T7 (Clontech) 
20 Plasmid name: pPL269- pGBK-T7 GAL4 POSHdR 

Protein sequence: Corresponds to aa 53-888 of POSH (RING domain deleted) 

RTLVGSGVEELPSNILLVRIiLDGIKQRPWKPGPGGGSGTNCTNALRSQSSTVANCSSKDL 
QSSQGGQQPRVQSWSPPVRGI PQIiPCAKALYNYEGKEPGDIjKFSKGDI I ILRRQVDENWY 
HGEVNGIHGFFPTNFVQIIKPLPQPPPQCKAIjYDFEVKDKEADKDCLPFAKDDVL.TVIRR 
25 VDENWAEGMLADKIGIFPISYVEFNSAAKQLIEWDKPPVPGVDAGECSSAAAQSSTAP^ 
SDTKKNTKKRHSFTSLTMA^ 

APSQVHISTTGLIVTPPPSSPVTTGPSFTFPSDVPYQAALGTIiNPPLPPPPLLAATVIiAS 
TPPGATAAAAAAGMGPRPMAGSTMIAHLRPQTRPSVYVAIYPYTPRKEDELELJIKGEME 
LVFERCQDGWFKGTSMHTSKIGVFPGNYVAPVT^ 

30 PSTAGGPAQKLQGNGVAGS PS WPAAWSAAHI QTS PQAKVLLHMTGQMTVNQARNAVRT 
VAAHNQERPTAAVTP IQVQNAAGLS PASVGLSHHSLAS PQPAPLMPGSATHTAAI S I SRA 
SAPLiACAAAAPLTSPSITSASLEAEPSGRIVTVLPGLPTSPDSASSACGNSSATKPDKDS 
KKEKKGLLKLLSGASTKRKPRVSPPASPTLEVELGSAELPIiQGAVGPELPPGGGHGRAGS 
CPVDGDGPVTTAVAGAALAQDAFHRKASSLDSAVPIAPPPRQACSSLGPVLNESRPWCE 

35 RHRVWSYPPQSEAELELKEGDIVFVHKKREDGWFKGTLQRNGKTGLFPGSFVENI 
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Library screened: Hela pretransformed library (Clontech). 

POSH-APs identified by yeast two-hybrid assay are provided in Tables 7 and 
8. Also, the nucleic acid and amino acid sequences of POSH-APs identified by 
yeast two-hybrid assay are provided in Figure 36. In addition, the nucleic acid and 
5 amino acid sequences of ARF1 and ARF5 are provided in Figure 36. 

Example 12. Inhibition of PKA Kinase Activity Attenuates HIV-1 Virus Maturation 
HeLa SS6 cells were transfected with pNLenv-lpTAP or pNLenv-l A TAA (L- 
domain mutant). Eighteen hours post-transfection, cells were transferred to 20 °C for 

10 two hours in order to inhibit transport of viral particles from the /rajw-Golgi (TGN) 
to the plasma membrane (PM). Subsequently, the PKA inhibitor, H89 (50 jiM) 
(Biosource, Cat. No. PHZ11 14) or DMSO were added to the cells and dishes were 
transferred to 37 °C to initiate transport from the TGN to the PM. Reverse 
transcriptase activity was assayed from virus-like-particles collected from cell 

15 supernatant twenty minutes later. H89 treatment resulted in complete inhibition of 
RT activity. Thus, demonstrating that PKA activity is required for HIV-1 viral 
maturation. 

Materials and methods: 

20 Cell culture and transfections 

Hela SS6 cells were grown in Dulbecco's modified Eagle's medium 
(DMEM) supplemented with 10% heat-inactivated fetal calf serum and 100 units/ml 
penicillin and 100 ng/ml streptomycin. For transfections, HeLa SS6 cells were 
grown to 100% confluency in DMEM containing 10% FCS without antibiotics. 
25 Cells were then transfected with HIV-lNLenvi (2 j-ig per 6-well) (Schubert et al., 
1995). 

Assays for virus release by RT activity 

Virus and virus-like particle (VLP) release by reverse transcriptase activity 
was determined one day after transfection with the pro-viral DNA as previously 
30 described (Adachi et al., 1 986; Fukumori et al., 2000; Lenardo et al, 2002). The 
culture medium of virus-expressing cells was collected and centrifuged at 500 x g 
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for 10 minutes. The resulting supernatant was passed through a 0.45 nm-pore filter 
and the filtrate was centrifuged at 14,000 x g for 2 hours at 4 °C. The resulting 
supernatant was removed and the viral-pellet was re-suspended in cell solubilization 
buffer (50 mM Tris-HCl, pH7:8, 80 mM potassium chloride, 0.75 mM EDTA and 
5 0.5% Triton X-100, 2.5 mM DTT and protease inhibitors). The corresponding cells 
were washed three times with phosphate-buffered saline (PBS) and then solubilized 
by incubation on ice for 15 minutes in cell solubilization buffer. The cell detergent 
extract was then centrifuged for 15 minutes at 14,000 x g at 4 °C. The sample of the 
cleared extract (normally 1:10 of the initial sample) were resolved on a 12.5% SDS- 
10 polyacrylamide gel, then transferred onto nitrocellulose paper and subjected to 
immunoblot analysis with rabbit anti-CA antibodies. The CA was detected after • 
incubation with a secondary anti-rabbit antibody conjugated to Cy5 (Jackson 
Laboratories, West Grove, Pennsylvania) and detected by fluorescence imaging 
(Typhoon instrument, Molecular D ynamics, Sunnyvale, California). The Pr55 and 
1 5 CA were then quantified by densitometry. A colorimetric reverse transcriptase assay 
(Roche Diagnostics GmbH, Mannenheim, Germany) w as used to measure reverse 
transcriptase activity in VLP extracts. RT activity was normalized to amount of Pr55 
and CA produced in the cells. 

20 Example 13. hPOSH is phosphorvlated bv Protein kinase A (PKA) 

PKA is a cAMP-dependent kinase. The holoenzyme is a tetramer of two 
catalytic subunits (cPKA) bound to two regulatory subunits PRKRl or PRKR2. 
Activation proceeds by the cooperative binding of two cAMP molecules to each R 
subunit, which causes the dissociation of each active C subunit from the R subunit 

25 dimer. The consensus sequence for phosphorylation by the C subunit is, stringently, 
K/R-R-X-S/TY and less stringently, R-X-X-S/TY, where Y tends to be a 
hydrophobic r esidue. T he i ntracellular 1 ocalization o f P KA i s c ontrolled t borough 
association with A -kinase-anchoring proteins ( AKAPs). The regulatory subunit o f 
protein kinase A (PRKR1A) was identified as a POSH interactor by yeast-two- 

30 hybrid screen, thereby implicating POSH as an AKAP. 

Protein kinase A was demonstrated to be required for the budding of 
transport vesicles from the TGN (Muniz et al., 1997, Proc Natl Acad Sci USA, 
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94:14461-6). Furthermore, it was demonstrated that an inhibitor of PKA, H89, is 
able to block HIV-1 release from cells (Cartier et al., 2003, J Biol Chem., 
278:35211-9). Since POSH is localized at the TGN and is implicated as an AKAP, 
POSH may regulate PKA-mediated budding at the TGN of vesicles and HIV-1 . 

5 Applicants demonstrated that POSH is phosphbrylated by PKA. Several 

putative PKA phosphorylation sites are found within hPOSH coding sequence 
(Figure 30). Phosphorylation of gravin, an AKAP, by PKA modulates its binding to 
the b2-adrenergic receptor. This serves to regulate the mobilization of gravin and 
PKA to the cell membrane and regulation of b2-AR activity by PKA. Two putative 

10 PKA sites are located in the putative-rac-binding region in POSH. Toward this end, 
POSH was subjected to in-vitro phosphorylation and binding to the small GTPase 
Racl (Figure 31). Indeed, only unphosphorylated POSH was able to bind activated, 
GTP-loaded, Racl, demonstrating that phosphorylation regulates the binding of 
POSH to small GTPases, such as Racl. GTPases of this sort family include TCL, 

15 TC10, Cdc42, Wrch-1, Rac2, Rac3 or RhpG (Aspenstrom et al., 2003, Biochem J., 
377(Pt 2):327-37). Small GTPases of this sort are involved in protein trafficking in 
the secretory system, including the trafficking of viral proteins, such as those of 
HIV. 

Materials and methods 

20 PKA-dependent phosphorylation of hPOSH. 

Bacterially expressed recombinant maltose-binding-protein (MBP)-hPOSH 
(3 \ig) or GST-c-Cbl were incubated at 30oC for 30 minutes with (*) or without 10 
ng PKA catalytic subunit (PKAc) in a buffer containing 40 mM Tris-HCl pH 7.4, 10 
mM MgCl 2 , 4 mM ATP, 0.1 mg/ml BSA, 1 jiM cAMP, 23 mM K3PO4, 7 nM DTT, 
25 and PKA peptide protection solution (Promega, Cat.No. V5340). The reaction was 
stopped by the addition of SDS-sample buffer, and boiling for 3 minutes. Samples 
were separated by SDS-PAGE on a 10% gel, and transferred to nitrocellulose and 
immunoblotted as detailed in the figure. 

Binding of Racl to hPOSH 
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Bacterially expressed hPOSH (1 ^g) or GST (1 jig) were phosphorylated as 
above. The reaction was terminated by the addition 0.5 ml of ice-cold 200 mM Tris- 
HC1 pH 7.4, 5 mM EDTA. hPOSH and GST were then immobilized on NiNTA or 
reduced glutathione beads, respectively, by gentle mixing for 30 minutes. The 

5 immobilized proteins were washed three times with wash buffer (50 mM Tris-HCl 
pH 7.4, 100 mM NaCl, 5 mM MgC12, 0.1 mM DTT). Recombinant Rac-1 (0.2 \xg) 
(Sigma catalog # R3012) was incubated with or without 0.3 mM GTPyS (Sigma Cat. 
No. G8638) on ice for 15 minutes. The GTP/mock-loaded Rac-1 was then added to 
wash buffer (25 |xl, final) and incubated for 30 minutes at 30 °C. The beads were 

1 0 then washed three times with wash buffer containing 0. 1 % Tween 20. Sample buffer 
was added to the bead pellet and boiled for 3 minutes. Immobilized and associating 
proteins were then separated by SDS-PAGE on a 12% gel and immunobloted with 
anti-Rac-1 (Santa Cruz Biotechnology, Cat. No. sc-217). Input is 0.25 \ig of Rac-1. 

15 Example 14. HERPUD1 Depletion bv siRNA Reduces HIV Maturation. 

Hela SS6 cells were transfeted with siRNA directed against HERPUD1 and 
with a plsmid encoding HIV proviral genome (pNLenv-1). Twenty four hours post- 
HIV transfection, virus-like particles (VLP) secreted into the medium were isolated 
and reverse transcriptase activity was determined. HIV release of active RT is an 

20 indication for a release of processed and mature virus. When the levels of 

HERPUD1 were reduced RT activity was inhibited by 80%, demonstrating the 
importance of HERPUD1 in HIV-maturation. See Figure 33. 
Experimental Outline 

• Cell culture and transfection: 

25 HeLa SS6 were kindly provided by Dr. Thomas Tuschl (the laboratory of 

RNA Molecular Biology, Rockefeller University, New York, New York). Cells 
were grown in Dulbecco's modified Eagle's medium (DMEM) supplemented with 
10% heat-inactivated fetal calf serum and 100 U/ml penicillin and 100 ^g/ml 
streptomycin. For transfections, HeLa SS6 cells were grown to 50% confluency in 

30 DMEM containing 10% FCS without antibiotics. Cells were then transfected with 
the relevant double-stranded siRNA (50-100nM) (HERPUD1: 5'- 
GGGAAGUUCUUCGGAACCUdTdT-3 9 and 5'- 
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dTdTCCCUUCAAGAAGCCUUGGA-5 > ) using lipofectamin 2000 (Invitrogen, 
Paisley, UK). A day following the initial transfection cells were split 1:3 in complete 
medium and co-transfected 24 hours later with HIV-lNLenvl (2 \ig per 6-well) 
(Schubert et al., J. Virol. 72:2280-88 (1998)) and a second portion of double- 
5 stranded siKNA. 

• Assay for virus release 
Virus and virus-like particle (VLP) release was determined one day after 
transfection with the proviral DNA as previously described (Adachi et al., J. Virol. 
59: 284-91 (1986); Fukumori et al., Vpr. Microbes Infect. 2: 1011-17 (2000); 
10 Lenardo et al., J. Virol. 76: 5082-93 (2002)). The culture medium of virus- 
expressing cells was collected and centrifuged at 500 x g for 10 minutes. The 
resulting supernatant was passed through a 0.45p.m-pore filter and the filtrate was 
centrifuged at 14,000 x g for 2 hours at 4°C. The resulting supernatant was removed 
and the viral-pellet was re-suspended in SDS-PAGE sample buffer. The 
15 corresponding cells were washed three times with phosphate-buffered saline (PBS) 
and then solubilized by incubation on ice for 15 minutes in lysis buffer containing 
the following components: 50 mM HEPES-NaOH, (pH 7.5), 150 mM NaCl, 1.5 mM 
MgCl 2 , 0.5% NP-40, 0.5% sodium deoxycholate, 1 mM EDTA, 1 mM EGTA and 
1 :200 dilution of protease inhibitor cocktail (Calbiochem, La Jolla, California). The 
20 cell detergent extract was then centrifuged for 15 minutes at 14,000 x g at 4°C. The 
VLP sample and a sample of the cleared extract (normally 1 : 10 of the initial 
sample) were resolved on a 12.5% SDS-polyacrylamide gel, then transferred onto 
nitrocellulose paper and subjected to immunoblot analysis with rabbit anti-CA 
antibodies. The CA was detected either after incubation with a secondary anti-rabbit 
25 horseradish peroxidase-conjugated antibody and detected by Enhanced Chemi- 

Luminescence (ECL) (Amersham Pharmacia) or after incubation with a secondary 
anti-rabbit antibody conjugated to Cy5 (Jackson Laboratories, West Grove, 
Pennsylvania) and detected by fluorescence imaging (Typhoon instrument, 
Molecular Dynamics, Sunnyvale, CA). The Pr55 and CA were then quantified by 
30 densitometry and the amount of released VLP was then determined by calculating 
the ratio between VLP-associated CA and intracellular CA and Pr55 as previously 
fWribed rSchubert et al., J. Virol. 72:2280-88 (1998)). 
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• Analysis of reverse transcriptase activity in supernatants 
RT activity was determined in pelleted VLP (see above) by using an RT 
assay kit (Roche, Germany; Cat. No. 1468120). Briefly, VLP pellets were 
resuspended in 40 pi RT assay lysis buffer and incubated at room temperature for 30 

5 minutes. At the end of incubation 20 jil RT assay reaction mix was added to each 
sample and incubation continued at 37°C overnight. Samples (60 jil) were than 
transferred to MTP strip wells and incubated at 37°C for 1 hour. Wells were washed 
five times with wash buffer and DIG-POD added for a one-hour incubation at 37°C. 
At the end of incubation wells were washed five times with wash buffer and ABST 

10 substrate solution was added and incubated until color developed. The absorbance 
was read in an ELISA reader at 405 nm (reference wavelength 492 nm). The 
resulting signal intensity is directly proportional to RT activity; RT concentration 
was determined by plotting against a known amount of RT enzyme included in 
separate wells of the reaction. 

15 

Example 15. MSTP028 Reduction bv siRNA Decreases HIV VLP Production. 

This example demonstrates the effects of an siRNA-mediated decrease in 
MSTP028 expression on the production of HIV virus-like particles in HeLa cells. 
The effects were measured at steady state. 
20 Experiments were performed according to two different protocols. 

Experiment 1 proceeded with a second transfection on day 3, while Experiment 2 
involved an additional exchange of medium on day 3, and proceeded to the second 
transfection on day 4. The results from Experiment 1 are shown Figure 29A, and 
those for Experiment 2 are shown in Figure 29B. 

25 

Day 1 : Preparing Cells 

4.5X105 HeLa SS6 cells/well, were seeded in 1 x 6 well plates. Cells were seeded 
in transfection medium (growing medium free of antibiotics). 

30 

Materials: 

Cat. No. Manufacture Reagent Name 
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D5796 Sigma DMEM 

04-121-1 A BeitHaemek FCS 

D8537 Sigma PBS 

P4333 Sigma Pen/Strep 

T4049 Sigma 0.25% Trypsin-EDTA 



Day 2: Transfection 



Materials: 

10 Cat. No. Manufacture Reagent Name 

11668-027 Invitrogen LF2000 reagent 

31985-047 GibcoBRL OptiMEM 

MSTP028 RNAi constructs: 

15 siRNA target sequence Accession Pos. 

MST028 AAGTGCTCACCGACAGTGAAG NM__031954 197 

MST028 AAGATACTTATGAGCCTTTCT NM 031954 392 



Experimental and Control Conditions: 
20 1 - Control siRNA+ pNLEnv-1 

2- POSH siRNA + pNLenv-1 

3- MSTP028 siRNA + pNLenv-1 



1 . Two hours before transfection, replace cell media to 2ml/well complete 
25 DMEM without antibiotics. 

2. siRNA dilution: for each transfection dilute 100 nm siRNA in 0.25 ml 
OptiMEM per well. 

3. LF 2000 dilution: for each well dilute 5ul lipofectamine reagent in 0.25ml 
OptiMEM. 

30 4. Incubate diluted siRNAs and LF 2000 for 5 minutes at RT. 

5. Mix the diluted siRNAs with diluted LF2000 and incubated for 25 minutes at 
RT. 



-148- 



WO 2004/078130 



PCTAJS2004/006308 



6. Add the mixture to the cells, 0.5 ml/well (drop wise) and incubate for 24 
hours at 37°C in C0 2 incubator. 

Transfections: for each well 
5 (1 2.5 nl (siRNA)/ 0.25 ml OptiMEM) x 3 
LF 2000 35 ^1 /1.75 ml 

Day 3: 

1 0 Exp. 1 : second transfection (as Day 4 below). 
Exp. 2: Exchange medium. 

Day 4: 

15 Exp. 1 : VLP assay (see below). 
Exp. 2: Second transfection 

1 . Two hours before transfection, replace cell media to 2ml/well complete 

DMEM without antibiotics. 
20 2. siRNA and DNA dilution: Prepare dilution of plasmid pNLenv-1 0.75 \ig / 

well in 0.25 ml OptiMEM (total of 3 wells). Divide plasmid dilution to 

eppendorf tubes (0.25 ml each). To each tube add siRNA 40nM (2.5 jil). 
3. LF 2000 dilution: for each well dilute 5|il lipofectamine reagent in 0.25ml 

OptiMEM. 

25 4. Incubate diluted siRNAs and LF 2000 for 5 minutes at RT. 

5. Mix the diluted siRNAs with diluted LF2000 and incubated for 1 hour at RT. 

6. Add the mixture to the cells, 0.5 ml/well (drop wise) and incubate for 24 
hours at 37°C in C0 2 incubator. 

30 Day 5: 

Exp. 2: VLP assay. 
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Solutions: 



Lysis buffer 
Tris-HClpH7.6 
5 MgC12 



50mM 



1.5mM 



NaCl 



150mM 



Glycerol 



10% 



NP-40 



0.5% 



DOC 



0.5% 



10 



EDTA 



ImM 



EGTA 



ImM 



Add PI 3 C 1:200. 

Steady state VLP assay 
15 A. Cell extracts 

1 . Pellet floating cells by centrifugation (lmin, 14000rpm at 40C), save 

supernatant (continue with supernatant immediately to step B), scrape cells 
in ice-cold PBS, add to the corresponding floated cell pellet and centrifuge 
for 5min 1800rpm at 40C. 
20 2. Wash cell pellet once with ice-cold PBS. 

3. Resuspend cell pellet (from 6 well) in 100 julI NP40-DOC lysis buffer and 
incubate 10 minutes on ice. 

4. Centrifuge at 14,000rpm for 1 5min. Transfer supernatant to a clean 
eppendorf. 

25 5. Prepare samples for SDS-PAGE by adding them sample buffer and boil for 
lOmin - take the same volume for each reaction (15 jil). 

B. Purification of VLP from cell media 
1 . Filtrate the supernatant through a 0A5\xm filter. 
30 2. Centrifuge supernatant at 14,000rpm at 40C for at least 2h. 

3. Resuspend VLP pellet in 50 |il IX sample buffer and boil for 10 min. Load 



25 pi of each sample. 
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C. Western Blot analysis 

1 . Run all samples from stages A and B on Tris-Gly SDS-PAGE 12.5%. 

2. Transfer samples to nitrocellulose membrane (1 00V for 1 . 1 5h.). 
5 3. Dye membrane with ponceau solution. 

4. Block with 10% low fat milk in TBS-t for lh. 

5. Incubate with anti p24 rabbit 1 :500 in TBS-t 2 hour (room temperature) - 
overnight (40C). 

6. Wash 3 times with TBS-t for 7min each wash. 

10 7. Incubate with secondary antibody anti rabbit cy5 1 :500 for 30min. 

8. Wash five times for lOmin in TBS-t 

9. View in Typhoon for fluorescence signal (650). 



Example 16. POSH-depleted cells have lower levels of Hern and it is not 
15 monoubiquitinated 

POSH-depleted cells and their control counterparts were lysed and 
immunoblotted with anti-herp antibodies. Cells depleted of POSH (HI 53 RNAi 
stables cell lines) cells have lower levels of Herp compared with control cells (HI 87 
RNAi) (Figure 34A panel A). When cells were trasnfected with a plasmid encoding 

20 flagged-tagged ubiquitin, and immunoprecipitated with anti-flag antibodies to 

immunoprecipitate ubiquitinated proteins, Herp was ubiquitinated only in HI 87 cells 
and not in HI 53 cells (Figure 34A panel B). When the aforementioned cells were 
transfected with Herp-encoding plasmid, exogenous herp levels were also reduced in 
HI 53 cells compared to HI 87 cells (Figure 34B panel A) and the ubiquitination of 

25 exogenous herp was reduced in the former cells, similar to endogenous Herp. The 
molecular weight of ubiquitinated Herp is as predicated to full-length Herp and does 
not seem as a high molecular weight smear, a characteristic of polyubiquitinated 
proteins. Thus POSH is responsible for the mono-ubiquitination of Herp, and in the 
absence of this modification herp is subjected to degradation, which may be 

30 mediated by the proteosome. 

Materials and methods 
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Plasmid generation 

Full-length Heip was cloned from image clone MGC:45131 IMAGE:5575914 
(GeneBank Accesion BC032673) into pCMV-SPORT6. 

5 

Antibody production 

Herpl (amino acids 1 to 251) was amplified from a plasmid (3Gd4) obtained by 
yeast two hybrid screen for interactors of POSH. The amplified open reading frame 
was cloned into pGEX-6P, expressed in E. coli BL21 by induction with 1 mM IPTG 
10 and purified on glutathione-agarose. The purified protein was cleaved with 
Precision™ protease (Amersham Biosciences) and the GST moiety removed by 
glutathione chromatography. The protein was injected into rabbits (Washington 
Biotechnology) to produce anti-Herpl sera. 

1 5 Transfections and antibody detection 

Twenty-four hours prior to transfection POSH-RNAi clones (HI 53) or control- 
RNAi clones (HI 87) cells were plated in 10 cm dishes in growth medium (DMEM 
containing 10% fetal calf serum without antibiotics). Cells were transfected with 
lipofectamin 2000 (Invitrogen Corporation) and either Herp-expression plasmid (2.5 

20 |xg) or empty vector (2.5 jxg) and a vector encoding Flag-tagged ubiquitin (1 |J.g). 
Twenty-four hours post-trasnfection cells were lysed in lysis buffer (50 mM Tris- 
HC1, pH7.6, 1.5 mM MgC12, 150 mM NaCL, 10% glycerol, 1 mM EDTA, 1 mM 
EGTA, 0.5% NP-40 and 0.5% sodium deoxycholate, containing protease inhibitors) 
and subjected to immunoprecipitation with anti-Flag antibodies (Sigma, F7425) to 

25 precipitate ubiquitinated proteins. Immunoprecipitated material and total cell lysates 
were separated on 10% SDS-PAGE and transferred to nitrocellulose membranes 
which were immunoblotted with anti-Herp antibodies. 

Generation of HI 87 and H153 cell lines 
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To relieve the necessity for multiple transfections and to improve the reproducibility 
of hPOSH reduction, we have generated two cell lines, H187 and H153 
constitutively expressing an integrated control and hPOSH siRNA (respectively). 

Construction of shRNA retroviral vectors- hPOSH scrambled oligonucleotide (5'- 
5 CACACACTGCCGTCAACTGTTCAAGAGACAGTTGACGGCAGTGTGTGTTT 
TTT-3 > ; and 5 ' - AATTAAAAAAC AC AC ACTGCCGTCAACTGTCTCTTGAAC A 
GTTGACGGCAGTGTGTGGGCC- 3') were annealed and cloned into the Apal- 
EcoRI digested pSilencer 1.0-U6 (Ambion, Inc.) to generate pSIL-scrambled. 
Subsequently, the U6-promoter and RNAi sequences were digested with BamHI, 
10 and blunted by end filling. The insert was cloned into the Olil site in the retroviral 
vector, pMSCVhyg (BD Biosciences Clontech), generating pMSCVhyg-U6- 
scrambled. The hPOSH oligonucleotide encoding RNAi against hPOSH 

(5'-AACAGAGGCCTTGGAAACCTGGAAGCTTGCAGGTTTCCAAGGCCTCT 
GTT-3'; and 

1 5 5 '-GATCAACAGAGGCCTTGGAAACCTGCAAGCTTCCAGGTTTCC AAGGC 
CTCTGTT-3') were annealed and cloned into the BamHI-EcoRV site of pLIT-U6, 
generating pLIT-U6 hPOSH-230. The pLIT-U6 is an shRNA vector containing the 
human U6 promoter (amplified by PCR from human genomic DNA with the 
primers, 5 ' -GGCCC ACTAGTC AAGGTCGGGCAGGAAGA-3 ' and 

20 5 ' -GCCGAATTC AAAAAGGATCCGGCGAT ATCCGGTGTTTCGTCCTTTCC A- 
3') cloned into pLITMUS38 (New England Biolabs, Inc.) digested with Spel-EcoRL 
Subsequently, the U6 promoter-hPOSH shRNA (pLIT-U6 hPOSH-230 digested 
with SnaBI and Pvul) was cloned into the Olil site of pMSCVhyg (BD Biosciences 
Clontech) generating pMSCVhyg U6-hPOSH-230. 
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Recombinant retrovirus production- HEK 293T cells were transfected with 
retroviral RNAi plasmids (pMSCVhyg-U6-POSH-230 and pMSCVhyg-U6- 
scrambled and with plasmids encoding VSV-G and Moloney Gag-pol. Two days 
post-transfection, the retrovirus-containing medium was collected and filtered. 

5 Infection and selection- Polybrene (Hexadimethrine bromide) (Sigma) (8|xg/ml) 
was added to the filtered and the treated medium was subsequently used to infect 
HeLa SS6 cells. Forty-eight hours post-infection clones were selected for RNAi 
expression by the addition of hygromycin (300 ng/ml). Clones expressing the 
scrambled and the hPOSH RNAi were termed HI 87 and H153 (respectively). 

10 Example 17. Inhibition of HBV production 

HepG2.2.15 cells were plated on 9cm dishes and allowed to grow in 8% FCS 
for 5 days up to 70% confluence. After 5 days, cells were washed twice with PBS 
and re-supplied with fresh DMEM without FCS. In this medium, cells were treated 
every 24 hours with the depicted solutions (3jll1 solution/lml medium) for another 4 
15 days (4 treatments total). After 4 days, medium was collected from each plate, 
viruses were sedimented and analyzed. 

As shown in Figure 35, lanes 7 and 8, compounds CAS number 14567-55-4 
and CAS number 414908-38-0 inhibit HBV production at a concentration of 3pM. 
Detection of HBV proteins was performed essentially as described in Paran, N et al 
20 (2001) EMBO J 20(16):4443-4453. 

INCORPORATION BY REFERENCE 

All publications and patents mentioned herein are hereby incorporated by 
reference in their entirety as if each individual publication or patent was specifically 
25 and individually indicated to be incorporated by reference. In case of conflict, the 
present application, including any definitions herein, will control. 
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EQUIVALENTS 

While specific embodiments of the subject applications have been discussed, 
the above specification is illustrative and not restrictive. Many variations of the 
5 applications will become apparent to those skilled in the art upon review of this 
specification and the claims below. T he full scope of the applications should be 
determined by reference to the claims, along with their full scope of equivalents, and 
the specification, along with such variations. 

10 
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What Is Claimed: 

1 . An isolated, purified or recombinant complex comprising a POSH 
polypeptide and a POSH-associated protein (POSH-AP). 

2. The complex of claim 1, wherein the POSH-AP comprises a polypeptide 
5 selected from the group consisting of: PKA, SNX1, SNX3, ATP6V0C, 

PTPN12, PPP1CA, GOSR2, CENTB1, DDEF1, ARF1, ARF5, PACS-1, 
EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, EIF3S3, SRA1, CBL- 
B, RALA, SIAH1, SMN1, SMN2, SYNE1, TTC3, VCY2IP1 and UBE2N 
(UBC13). 

10 3 . The complex of claim 1 , wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: ARHV (Chp), WASF1, HIP55, 
SPG20, HLA-A, arid HLA-B. 

4. The complex of any one of claims 1 -3 , wherein the POSH polypeptide is a 
human POSH polypeptide. 

15 5 . An isolated, purified or recombinant complex comprising HERPUD 1 and a 
Ubiquitin ligase. 

6. The complex of claim 5, wherein the Ubiquitin ligase is selected from the 
group consisting of: POSH, CBL-B, TTC3, and SIAH1. 

7. A method for identifying an agent that modulates an activity of a POSH 
20 polypeptide or POSH-AP, the method comprising identifying an agent that 

disrupts a complex of any one of claims 1-3, wherein an agent that disrupts a 
complex of any of claims 1-3 is an agent that modulates an activity of the 
POSH polypeptide or the POSH-AP. 

8. A method of identifying an antiviral agent, comprising: 

25 (a) identifying a test agent that disrupts a complex comprising a POSH 

polypeptide and a POSH-AP; and 
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(b) evaluating the effect of the test agent on a function of a virus, 

wherein an agent that inhibits a pro-infective or pro-replicative function of a 
virus is an antiviral agent. 

9. The method of claim 8, wherein the POSH-AP is selected from the group 
5 consisting of: PKA, SNX1, SNX3, PTPN12, GOSR2, CENTB1, ARF1, 

ARF5, PACS-1, EPS8L2, HERPUD1, SMN1, SMN2, UNC84B, MSTP028, 
GOCAP, CBL-B, SYNE1, UBE2N (UBC13), SIAH1, TTC3, WASF1, 
HIP55, RALA, and SPG20. 

10. The method of claim 8, wherein the virus is an envelope virus. 

10 11. The method of claim 8, wherein the virus is a Human Immunodeficiency 
Virus. 

12. The method of claim 8, wherein the virus is a West Nile Virus. 

13. The method of claim 8, wherein the virus is Moloney Murine Leukemia 
Virus (MMuLV). 

15 14. The method of claim 8, wherein evaluating the effect of the test agent on a 
function of the virus comprises evaluating the effect of the test agent on the 
budding or release of the virus or a virus-like particle. 

15. A method of identifying an anti-apoptotic agent, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
20 polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on apoptosis of a cell, 

wherein an agent that decreases apoptosis of the cell is an anti-apoptotic 
agent. 

16. A method of identifying an anti-cancer agent, comprising: 
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(a) identifying a test agent that disrupts a complex comprising a POSH 
polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on proliferation or survival of a 
cancer cell, 

5 wherein an agent that decreases proliferation or survival of a cancer cell is an 

anti-cancer cell. 

17. The method of claim 16, wherein the POSH-AP is selected from the group 
consisting of: PKA, SNX1, PTPN12, PPP1CA, ARF1, ARF5, CENTB1, 
EPS8L2, EIF3S3, CBL-B, RALA, S1AH1, TTC3, ATP6V0C, and VCY2IP1. 

10 18. The method of claim 16, wherein the cancer cell is a cell derived from a 
POSH-associated cancer. 

19. A method of identifying an agent that inhibits trafficking of a protein through 
the secretory pathway, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
1 5 polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on the trafficking of a protein 
through the secretory pathway 

wherein an agent that disrupts localization of said POSH-AP is an agent that 
inhibits trafficking of a protein through the secretory pathway. 

20 20. The method of claim 19, wherein step (b) comprises evaluating the effect of 
the test agent on the trafficking of a myristoylated protein through the 
secretory pathway. 

21. The method of claim 19, wherein step (b) comprises evaluating the effect of 
the test agent on the trafficking of a viral protein through the secretory 
25 pathway. 
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22. The method of claim 19, wherein (b) comprises evaluating the effect of the 
test agent on the trafficking of a protein associated with a neurological 
disorder through the secretory pathway. 

23. The method of claim 22, wherein the protein associated with a neurological 
5 disorder is amyloid beta precursor protein. 

24. A method of identifying an agent that inhibits the progression of a 
neurological disorder, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
polypeptide and a POSH-AP; and 

10 (b) evaluating the effect of the test agent on the trafficking of a protein 

through the secretory pathway 

wherein an agent that disrupts localization of a POSH-AP is an agent that 
inhibits progression of a neurological disorder. 

25. The method of claim 24, wherein the POSH-AP is selected from the group 
1 5 consisting of: HERPUD1 , CBL-B, SIAH1 , and TTC3. 

26. The method of claim 25, wherein the POSH-AP is HERPUD1 . 

27. A method of identifying an agent that inhibits the progression of a 
neurological disorder, comprising: 

(a) identifying a test agent that disrupts a complex comprising a POSH 
20 polypeptide and a POSH-AP; and 

(b) evaluating the effect of the test agent on the ubiquitination of a 
protein. 

28. The method of claim 27, wherein the POSH-AP is HERPUD1. 
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29. A method of treating a viral infection in a subject in need thereof, comprising 
administering an agent that inhibits a POSH-AP in an amount sufficient to 
inhibit the viral infection. 

30. The method of claim 29, wherein the agent is selected from the group 
5 consisting of: 

i) an agent that inhibits a kinase activity of the POSH-AP; 

ii) an agent that inhibits expression of the POSH-AP; 

iii) an agent that inhibits the ubiquitin ligase activity of the POSH-AP; 

iv) an agent that inhibits the phosphatase activity of the POSH-AP; 
10 v) an agent that inhibits the GTPase activity of the POSH-AP; and 

vi) an agent that inhibits the ubiquitination of the POSH-AP. 

3 1 . The method of claim 29, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, SNX1, SNX3, SMN1, SMN2, 
PTPN12, GOSR2, CENTB1, ARF1, ARF5, PACS-1, EPS8L2, HERPUD1, 

15 UNC84B, MSTP028, GOCAP, CBL-B, SYNE1, UBE2N (UBC13), SIAH1, 

TTC3, WASF1, HIP55, RALA, and SPG20. 

32. The method of claim 31, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, HERPUD1, MSTP028, CBL-B, 
andUBE2N(UBC13). 

20 33. The method of claim 32, wherein said agent is selected from the group 

consisting of: an siKNA construct, a small molecule, an antibody, and an 
antisense construct. 

34. The method of claim 33, wherein the agent is an siRNA construct comprising 
a nucleic acid sequence that hybridizes to an mRNA encoding the POSH-AP. 
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35. 

5 36. 
37. 
38. 

10 

39. 

15 



The method of claim 34, wherein the agent is an siRNA construct or an 
antisense construct that inhibits the expression of a polypeptide selected from 
the group consisting of PKA, HERPUD1, MSTP028, CBL-B, and UBE2N 
(UBC13). 

The method of claim 35, wherein the agent is an siRNA construct or an 
antisense construct that inhibits the expression of HERPUD1 or MSTP028. 

The method of claim 36, wherein the siRNA construct inhibits the expression 
ofMSTP028. 

The method of claim 36, wherein the siRNA construct inhibits the expression 
of HERPUD1 and is selected from the group consisting of: 5'- 
GGGAAGUUCUUCGGAACCUdTdT-3 ' and 5'- 
dTdTCCCUUC AAGAAGCCUUGGA-5 ' . 

The method of claim 33, wherein the small molecule inhibitor is selected 
from among the following categories: adenosine cyclic 
monophosphorothioate, isoquinolinesulfonamide, piperazine, piceatannol, 
and ellagic acid. 

The method of claim 33, wherein the small molecule is selected from among: 




-161- 



WO 2004/078130 



PCTYUS2004/006308 





41 . The method of claim 23, wherein the small molecule inhibits the 
ubiquitination of a POSH-AP. 

10 42. The method claim 29, wherein the subject is infected with an envelope virus. 

43. The method of claim 42, wherein the envelope virus is an HIV. 

44. The method of claim 42, wherein the envelope virus is a WNV. 

45. The method of claim 29, wherein the virus is a MMuLV. 
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46. Use of a protein kinase A inhibitor for the manufacture of a medicament for 
treatment of a viral infection. 

47. Use of an inhibitor of HERPUD 1 for the manufacture of a medicament for 
treatment of a viral infection. 

5 48. Use of an inhibitor of MSTP028 for the manufacture of a medicament for 
treatment of a viral infection. 

49. A packaged pharmaceutical for use in treating a viral infection, comprising: 

(a) a pharmaceutical composition comprising an inhibitor of a POSH- AP 
and a pharmaceutical^ acceptable carrier; and 

10 (b) instructions for use. 

50. The packaged pharmaceutical of claim 49, wherein the viral infection is 
caused by an envelope virus. 

51. A method for identifying an antiviral agent comprising: 

(a) identifying a test agent that inhibits an activity of or expression of a 
15 POSH-AP; and 

(b) evaluating an effect of the test agent on a function of a virus. 

52. A method of evaluating an antiviral agent comprising: 

(a) providing a test agent that inhibits an activity of or expression of a 
POSH-AP; and 

20 (b) evaluating an effect of the test agent on a function of a virus. 

53. The method of claim 5 1 or 52, wherein the virus is an envelope virus. 

54. The method of claim 51 or 52, wherein the virus is a Human 
Immunodeficiency Virus. 
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55. The method of claim 51 or 52, wherein the virus is a West Nile Virus. 

56. The method of claim 5 1 or 52, wherein the virus is a MMuLV. 

57. The method of claim 51 or 52, wherein evaluating the effect of the test agent 
on a function of the virus comprises evaluating the effect of the test agent on 
the budding or release of the virus or a virus-like particle. 

58. The method of claim 5 1 or 52, wherein the POSH-AP is selected from the 
group consisting of: PKA, SNX1, SNX3, PTPN12, GOSR2, SMN1, SMN2, 
CENTB1, ARF1, ARF5, PACS-1, EPS8L2, HERPUD1, UNC84B, 
MSTP028, GOCAP, CBL-B, SYNE1, UBE2N (UBC13), SIAH1, TTC3, 
WASF1, H1P55, RALA, and SPG20. 

59. The method of claim 58, wherein the POSH-AP is HERPUD1. 

60. The method of claim 58, wherein the POSH-AP is MSTP028. 

61 . The method of claim 51 or 52, wherein the test agent is selected from among: 
an antisense nucleic acid, an siRNA construct, a small molecule, an antibody 
and a polypeptide. 

62. The method of claim 61, wherein the siRNA construct inhibits the expression 
of HERPUD1 and is selected from the group consisting of: 5'- 
GGGAAGUUCUUCGGAACCUdTdT-3 * and 5'- 

dTdTCCCUUC AAGAAGCCUUGGA-5 ' . 

63 . A method of identifying an agent that modulates a POSH function, 
comprising: 

a) identifying an agent that modulates a POSH-AP; and 

b) testing the effect of the agent on a POSH function. 

64. A method of evaluating an agent that modulates a POSH function, 
comprising: 
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a) providing an agent that modulates a POSH-AP; and 

b) testing the effect of the agent on a POSH function. 

65. The method of claim 64 or 65, wherein the POSH-AP comprises a 

polypeptide selected from the group consisting of : PKA, SNX1, SNX3, 
5 ATP6V0C, PTPN12, PPP1CA, GOSR2, CENTB1, DDEF1, ARF1, ARF5, 

PACS-1, EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, EIF3S3, 
SRA1, CBL-B, RALA, SIAH1, SMN1, SMN2, SYNE1, TTC3, VCY2IP1 
andUBE2N(UBC13). 

The method of claim 64 or 65, wherein the POSH-AP comprises a 
polypeptide selected from the group consisting of: ARHV (Chp), WASF1, 
H1P55, SPG20, HLA-A, and HLA-B. 

The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on the production of 
viral particles or virus like particles in a cell infected with an envelope virus. 

The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on a POSH 
enzymatic activity. 

The method of claim 68, wherein the POSH enzymatic activity is ubiquitin 
ligase activity. 

The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on POSH-mediated 
localization or secretion of a protein. 

71 . The method of claim 64 or 65, wherein testing the effect of the agent on a 
POSH function comprises testing the effect of the agent on the interaction of 

25 POSH with a POSH-AP. 

72. The method of claim 71, wherein the POSH-AP is a small GTPase. 



66. 

10 

67. 

15 68. 
69. 

20 70. 
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73. The method of claim 72, wherein the small GTPase is selected from the 
group consisting of: ARF1, ARF5, and RALA. 

74. The method of claim 64 or 65, wherein the test agent is selected from among: 
an antisense nucleic acid, an siRNA construct, a small molecule, an antibody 

5 and a polypeptide. 

75 . A method of identifying an agent that modulates a HERPUD 1 function, 
comprising: 

a) identifying an agent that modulates POSH; and 

b) testing the effect of the agent on a HERPUD 1 function. 

10 76. A method of evaluating an agent that modulates an HERPUD1 function, 
comprising: 

a) providing an agent that modulates POSH; and 

b) testing the effect of the agent on a HERPUD 1 function. 

77. The method of claim 75 or 76, wherein testing the effect of the agent on a 
15 HERPUD 1 function comprises contacting a cell with the agent and 

measuring the effect of the agent on ubiquitination of HERPUD 1 in the cell. 



78. A method of treating a viral infection in a subject in need thereof, comprising 
administering an agent that inhibits MSTP028 in an amount sufficient to 

20 inhibit viral infection. 

79. The method of claim 78, wherein said agent is selected from the group 
consisting of: an siRNA construct, a small molecule, an antibody, and an 
antisense construct. 

80. The method of claim 79, wherein the agent is an siRNA construct comprising 
25 a nucleic acid sequence that hybridizes to an inRNA encoding the MSTP028. 
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81. A method of inhibiting an activity of a POSH-AP in a cell, comprising 
contacting the cell with an inhibitor of POSH. 

82. The method of claim 81 , wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, SNX1, SNX3, ATP6V0C, 
PTPN12, PPP1CA, GOSR2, CENTB1, DDEF1, ARF1, ARF5, PACS-1, 
EPS8L2, HERPUD1, UNC84B, MSTP028, GOCAP, EIF3S3, SRA1, CBL- 
B, RALA, SIAH1, SMN1, SMN2, SYNE1, TTC3, VCY2IP1 and UBE2N 
(UBC13). 

83 . The method of claim 8 1 , wherein the inhibitor of POSH is selected from 
among the following: 

i) an agent that inhibits a POSH activity; and 

ii) an agent that inhibits expression of a POSH. 

84. The method of claim 83, wherein the POSH activity is ubiquitin ligase 
activity. 

85. A method of treating a POSH-associated disease in a subject, comprising 
administering a POSH-AP inhibitor to a subject in need thereof. 

86. The method of claim 85, wherein said POSH-AP inhibitor is an agent 
selected from the group consisting of: 

i) an agent that inhibits a kinase activity of the POSH-AP; 

ii) an agent that inhibits expression of the POSH-AP; 

iii) an agent that inhibits the ubiquitin ligase activity of the POSH-AP; 

iv) an agent that inhibits the phosphatase activity of the POSH-AP; 

v) an agent that inhibits the GTPase activity of the POSH-AP; and 

vi) an agent that inhibits the ubiquitination of the POSH-AP. 
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87. The method of claim 85, wherein the POSH-associated disease is a viral 
infection. 

88. The method of claim 85, wherein the POSH-associated disease is a POSH- 
associated cancer. 

5 89. The method of claim 85, wherein the POSH-associated disease is a POSH- 
associated neurological disorder. 

90. A method of identifying an anti-viral agent, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP and 
a test agent; and 

1 o b) detecting phosphorylation of the POSH polypeptide, 

wherein an agent that inhibits phosphorylation of POSH is an anti-viral 
agent. 

91 . A method of identifying an anti-viral agent, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP, 
1 5 ubiquitin and a test agent; and 



b) 



detecting ubiquitination of the POSH-AP, 



wherein an agent that inhibits ubiquitination of the POSH-AP is an anti-viral 
agent 



92. 



The method of claim 91, wherein the POSH-AP is HERPUD1. 



20 



93. 



A method of identifying a modulator of POSH, comprising: 



a) forming a mixture comprising a POSH polypeptide, a POSH-AP and 
a test agent; and 



detecting phosphorylation of the POSH polypeptide, 
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wherein an agent that alters phosphorylation of POSH is an agent that 
modulates POSH. 

94. A method of identifying a modulator of POSH, comprising: 

a) forming a mixture comprising a POSH polypeptide, a POSH-AP, 
5 ubiquitin and a test agent; and 

b) detecting ubiquitination of the POSH-AP, 

wherein an agent that inhibits ubiquitination of the POSH-AP. is an agent that 
modulates POSH. 

95. The method of claim 91, wherein the POSH-AP is HERPUD1. 

10 96. A method of treating or preventing a POSH associated cancer in a subject 
comprising administering an agent that inhibits a POSH-AP to a subject in 
need thereof, wherein said agent treats or prevents cancer. 

97. The method of claim 96, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PKA, SNX1, PTPN12, PPP1CA, 

15 CENTB1, ARF1, ARF5, EPS8L2, EIF3S3, CBL-B, RALA, SIAH1, TTC3, 

ATPV0C, and VCY2IP1. 

98. The method of claim 96, wherein the cancer is associated with increased 
POSH expression. 

99. A method of treating or preventing a POSH-associated neurological disorder 
20 in a subject comprising administering an agent that inhibits a POSH-AP to a 

subject in need thereof, wherein said agent treats or prevents the neurological 
disorder. 

1 00. The method of claim 99, wherein the POSH-AP comprises a polypeptide 
selected from the group consisting of: PTPN12, DDEF1, EPS8L2, 

25 HERPUD1, GOCAP, CBL-B, SIAH1, SMN1, SMN2, TTC3, SPG20, SNX1, 

and ARF1. 
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101. A method of treating a neurological disorder comprising administering an 
agent to a subject in need thereof, wherein said agent, inhibits the ubiquitin 
ligase activity of POSH. 

102. A method of treating a neurological disorder comprising administering an 
5 agent to a subject in need thereof, wherein said agent inhibits the 

ubquitination of a POSH-AP. 

103. The method of claim 101 or claim 102, wherein the neurological disorder is 
selected from among: Alzheimer's disease, Parkinson's disease, 
Huntington's disease, schizophrenia, Niemann-Pick's disease, and prion- 

10 associated diseases. 

104. The use of an agent of claim 103, wherein the neurological disorder is 
Alzheimer's disease. 

105. The method of claim 101 or claim 102, wherein said agent is selected from 
the group consisting of: an siRNA construct, a small molecule, an antibody, 

1 5 and an antisense construct. 

106. The method of claim 105, wherein the small molecule is selected from 




CI 



among: 
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107. The method of claim 102, wherein the POSH-AP is HERPUD1 . 

10 108. The method of claim 6 1 , wherein the siRNA construct inhibits the 

expression of MSTP028 and is selected from the group consisting of: 5'- 
AAGTGCTC ACCGAC AGTG AAG-3 ' and 5'- 
AAGATACTTATGAGCCTTTCT-3 ' . 



15 
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Figure 1 : Human POSH Coding Sequence (SEQ JD-NGd^art 1) 

ATGGATGAAT CAG C CTTGTTGGATCTTTTGGAGTGT CC<jGTGTGT CTAGAGCGCCTTGATG CTTCTGCGA 
AGGTCTTGCCTTGCCAGCATAOGTTTTGCAAGCGATGCT 

CAGATGT C CCGAGTG CAGGACTCTTGTTGG CT CGGGTGT CGAGGAGCTTC C CAGTAACATC TTGCTGGTC 
AGACTTCTGGATGGCATCAAAC^GAGGCCTTGGAAACCTGGTCCTGGTGGG 

CAAATG CATT AAGGT CT CAGAGCAGCACTGTGG CT AATTGTAGCT CAAAAGAT CTGCAG AG CT CCCAGGG 
CGGACAG CAG C CT CGGGTGCAATCCTGGAG C CC C CCAGTGAGGGGT ATAC CT CAGTTACCATGTG C CAAA 
GCGTTATACAACTATGAAGGAAAAGAGCCTGGAGACCT^ 

GAAGACAAGTGGATG AAAATTGGTAC CATGGGGAAGTCAATGGAAT C QATGGCTTTTT C C C CACCAACTT 
TGTGCAGATTATTAAACCGTTACCTCAGC CCCCACCT CAGTGCAAAGC^CTTTATGACTTTGAAGTGAAA 
GACAAGGAAG (^GACAAAGATTG CCTT C CATTTG CAAAGGATGAT GTTCTGACTGTGAT C CGAAGAGTGG 
ATGAAAACTGGGC TG AAGGAATG CTGGCAGACAAAATAGGAATATTT CCAATTT CATATGT TGAGTTTAA 
CTCGGOTGCTAAGCAGCTGATAGAATGGGATAAGCCTCCTCTC 

T CGGCAGCAG CC CAGAGCAGCACTG C C CCAAAGCACT C CGACAC CAAGAAGAACAC CAAAAAG CGG CACT 

CCTT(^CTTCCCTCACTATGGCCAAC^GTCCrCC(^GGC^TCCCAGAA^ 

CCCCCCTGTCCTCATCAGCTCCAGCAACCCCACTGCT^ 

TG CAGTG C C CCTT CT CAGGTT CATATAAGTAC CAC CGGGTTAATTGTGAC CCCGC C C C CAAG CAG C C CAG 
TGACAACIK^CCCCT CX^TTACTTTCCCATCAGATGTTC CCTAC CAAGCTGCCCTTGGAACTTTGAATCC 
TCCTCTTCC^CCACCCCCTCTCCrrGGCKSCCACTGTCCTTGCCTCC^ 
GCTGCTGCTGCTGGAATGGGACCGAGGCCCATGGCAGGATCC^ 

AGACT CGC C C CAGTGTG T ATGTTG CTATAT AT C CATACACT C CTC<^AAAGAGGATGAACTAGAGCTGAG 
AAAAGGGGAGATGTTTTT AGTGTTTGAGCGCTGC CAGGATGG CTGGT T CAAAGGGACATC CATGCATAC C 
AGCAAGATAGGGGTTTT C C CTGG CAATTATGTGG CAC CAGT CACAAGGGCGGTGACAAATG CTTC CCAAG 
CTAAAGTCCCTATGT CTACAGCTGGCCAGACAAGTCGGGGAGTGACCATGGT CAGTCCTTC CACGGCAGG 
AGGG C CTGC C CAGAAGCT C CAGGGAAATGGCGTGG CTGGG AGT CC CAGTGTTGT C CCCGCAGCTGTGGTA 
T CAGCAG CT CACAT C CAG ACAAGT CCT CAGGCT AAGGT CT TGTTG CA CATGACGGGG CAAATGACAGT CA 
ACCAGGCCCGCAATGCTGTGAGGACAGTTGCAGCGCACAACCAGGAACGCCC 
CATCCAGGTACAGAATGCCGCO^CCTC^GCCCTGCATCTGTGGGCCTC^ 
CCACAACCTGCGCCTCTGATGCCAGGCTCAGCCACGCACACTGCTGCCATCAGTA 
CCCCTCTGGCCTGTGCAGCAGCTGCTCCACTGACT 

GCCCAGTGGCCX^ATAGTGACCGTTCTCCCTGGACTCCCCACATCTCCrGACAGTGCTTCA 
GGGAACAGTT CAG CAAC CAAAC CAGACAAGGATAGCAAAAAAGAAAAAAAGGG TTTGTTGAAGTTGCTTT 
CTGGCG C CTCCACTAAACGGAAG CCC CGCGTGTCTCCTCCAG CATCGCCCACCCTAGAAGTGGAGCTGGG 
CAGTGCAGAGCTTCCTCTCCAGGGAGCGGTGGGGCCCGAACTGCCACCAGGAGGT 
GGCTCCTGCCCTGTGGACGGGGACGGACCGGTCACGACTGCAGTGGCAGGAGCAGCCC 
CTTTT CATAGGAAGG CAAGTT CC CTGGAC T C CG CAGTTC C CATCG CT C CAC CT C CT CGC CAGG C CTGTT C 
CTCCCTGGGTCCTGTCTTGAATGAGTCTAGACCTGTCGTTTGTGAAAGGCACAGGGTGGTGGTTTCCTAT 
CCTCCTCAGAGTGAGGCAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTGTTCATAAAAAACGAGAGG 
' ATGG C TGGTT CAAAGGCACAT TACAA CGTAATGGGAAAACTGGC CTTT T C C CAGGAAGCTTTGTGG AAAA 
CATATGA 
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Figure 2: Human POSH Amino Acid Sequence (SEQ ID NO:2) (part 2) 



MDESALLDLLECPVCIjERIiDASAKVIj^ 
RLIJXSIKQRPWKPGPGGGSGTNCTNAIJ^ 

ALYNYBGKEPGDIjKFSKGDII IIiRRQVDENWYHGEVNGIHGFFPTNFVQI IKPIiPQPPPQCKALYDFEVK 
DKEADKDCLP FAKDDVLTVZRRVDENWAEGMLADKIG I FP I S YVE FNSAAKQli IEWDKPPVPGVEAGE CS 
SAAAQS STAP KHSDTKKNTKKRHSFTSLTMANKS SQASQNRHSME ISP PVL I SSSNPTAAARI SKLSGLS 
CSAPSQVHISTTGIirVTPPPSSPVTTGPSFTFPSDVPYQAAIiGTLNP 
AAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPYTPRKEDELEL^ 
SKIGVFPGNYVAPVTRAVTNA^QAKTOMSTAGQTSRGVTMVSPSTAG^ 
SAAHI QTSPQAKVLLHMTGQMTVNQARNAVRWAA^ 
PQPAPLMPGSATHTAAISISRASAPLAOU^AAPI^ 

GNSSATKPDKDSKiCEKKGIiliKIiliSGASTKIlKPRVSPPASPTIiEVEIjGSAE 

GS CPVDGDG P VTTAVAGAALAQDAFHRKAS SLDSAVP ZAP P P RQACS SLGP VIiNE SRP WCERHRVWS Y 
PPQSEAEIiELKEGDIVFVHKKREDGWFKGTLQRNGKTGIiFPGSFVENI 
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Figure 3: Human POSH cDNA Sequence (SEQ ID NO:3) 

CTGAGAGACA.CTGCGAG CGG CGAGCG CGGTGGGGCCX3 CATCTGCAT CAGC CGC CG CAG CCG CTGCGGGGC 
CG CGAACAAAGAGGAGGAGCCGAGGCGCGAGAG CAAAGT CTGAAATGGATGTTACATGAGT CATTTTAAG 
GGATGCACACAACTATGAACATTT CTGAAGATTTTTT CT CAGTAAAGT AGATAAAGATGGATGAAT CAGC 
CTTGTTGGATCTTTTGGAGTGTCCGGTGTGTCTAGAGCGCCT^ 
CAGCATACGTTTxXXIAAGCXIArGxxTGCT 

GCAGGACTCT TGTTGGCT CGGGTGT CGAGGAGCTTC C CAGT AACAT C TTG CTGGT CAGACTT CTGGATGG 
CATCAAACAGAGG CCTTGGAAAC CTGGTC CTGGTGGGGGAAGTGGGAC CAACTG CACAAATGCATT AAGG 
TCTCAGAGCAG CACTGTGGCTAATTGTAGCT CAAAAGAT CTG CAGAG CTC C CAGGG CGGACAG CAG CCTC 
GGGTG CAATC CTGGAGCC CCC CAGTCAGGGGT AT ACCT CAGTT AC CATGTGC CAAAGCGTTATACAACTA 
TGAAGGAAAAGAGCCTGGAGACCTTAAATTCAGCAAAGGCGACAT C^TCATTTTGCGAAGACAAGTGGAT 
GAAAATTGGT AC CATGGGGAAGT CAATGGAATC CATGGCTTTT T C CC CAC CAACTTTGTGCAGATTATT A 
AACCGTTACCTCAGCCCCCACOTCAGTGCAAAGCACTTTATGACT 

CAAAG AT TG C CTT C CATTTGCAAAGGATG ATGTT CTGAC TGTG AT C CGAAGAGTGGATGAAAACTGGGCT 
GAAGGAATGCTGG CAGACAAAATAGGAAT ATTT C CAATTT CATATGTTGAGTTTAACT CGG CTG CTAAGC 
AG CTGATAGAATGGGATAAG CCT CCTGTG C CAGG AGTTGATG C TGGAGAATGTTC CTCGG CAGCAGCCCA 
GAGCAGCACTGCC CCAAAGCACT CCGACACCAAGAAGA 

ACTATGGCCAACAAGTCCTCCCAGGCATCC CAGAACCGC CACTCCATGGAGAT CAGCC CCCCTGTCCTCA 
TCAGCTCCAGCAACCCCACTGCTGOTGCACGGATCAGC^ 

TCAGGTT CAT AT AAGTAC CAC CGGGTTAATTGTGAC CCCGC C C CCAAGCAGCC CAGTGACAACTGGCC C C 
TCGTTTACTTT CCCATCAGATGTTCC CTACCAAGCTGCC CTTGGAACTTTGAATC CTC CTCTTCCACCAC 
CCCCTCTCCTGGCTGCCACIGTCCTTGCCT 

AATGGGA C CGAGG C C CATGG CAGGAT C CACTGAC CAGAT TG CACATTTACGG C CG CAGACT CG C C C CAGT 
GTGTATGTTG CT AT AT AT C CATACAC TCCT CGGAAAG AGGATGAACTAGAG CTGAGAAAAGGGGAGATGT 
T TTTAGTGTTTGAG CGCTG C CAGGATGGC TGGTT CAAAGGGACAT C CATG CATAC CAGCAAGATAGGGGT 
T TT C C CTGGCAAT TATGTGG CAC CAGT CACAAGGG CGGTGACAAATG CTT C C CAAGCTAAAGT C C CTATG 
T CT ACAG CTGG C C AG ACAAGT CGGGGAGTG AC CATGGTC AGT C CTT C CACGGCAGGAG GGC CTG C C CAGA 
AGCTC CAGGGAAATGGCX3TGGCTGGGAGTCC CAGTGTTGTCCCCG CAGCTGTGGTATCAGCAGCTCACAT 
CCAGACAAGT C CT CAGG CTAAGGT CT TGT TG CACATGACGGGGCAAATGACAGT CAAC CAGGC CCGCAAT 
GCTGTGAGGACAGTTGCAGCGCACAACCAGteAACGCCCC^ 

ATGCCGCCGGCCTCAGCCCTGC^TCTGTGGGCCTGTCCCATC^CTCGCTGGCCTCCCCACAACCTGCGCC 
T CTGATG C CAGG CT CAG C CACG CACACTG CTG C CAT CAGT AT CAGT CGAG CCAGTG CC C CT CTGGC CTGT 
GCAGCAG CTG C T C CACTGAC TT C C C CAAGCATCAC CAGTGCT T CT CTGGAGG CTGAGC C CAGTGGC CGGA 
TAGTGAC CGTT CT C C CTG GAC T C C C CACAT CT C CTGACAGTGCTT CAT CAG CTTGTGGGAACAGTT CAGC 
AACCAAACCAGACAAGGATAGCAAAAAAGAAAAAAAGGGTTTGTT 

AAACGGAAGC C C CGCGTG T CT C CT C CAGCAT CG CC CAC C CTAGAAGTGGAG CTGGGCAGTGCAGAGCTT C 
CTCTCCAGGGAGCGGTGGGGCCCGAACTGCC^CCAGGAGGTGGCCATGGCAGGGCAGGCTCCTGCCCTGT 
GG ACGG GGAC GGACCGGT CACG A CTG CAGTGGCAGGAGCAGC C CTGG C C CAGGATG Cx X XX CATAGGAAG 
GC^GTTCCCTGGACTCCGCAGTTCCC^TCGCTCCACCTCCTCGCCAGGCCTGTTCCTCCCTGGGTCCTG 
TCTTGAATGAGT CTAGAC CTGT CGTTTGTGAAAGGCACAGGGTGGTGGTTT C CTATCCTCCTCAGAGTGA 
GGCAGAACTTGAACTTAAAGAAGGAGATATTGTGTTTGTTCATAAAAAACGAGAGGATGGCTGGTTCAAA 
GGCACATTACAACGTAATGGGAAAACTGGCCTTTT C CCAGGAAGCTTTGTGGAAAACATATGAGGAGACT 
GA CA CTGAAG AAG CTTAAAAT CACTT'CACA.C AACAAAGT AGC ACA^ TAACAGAAAG AG CACAT 
TTGTGGACTT C CAGATGGT CAG GAGATG AG CAAAGGATT GGT ATGTGACT CTGATGCC C CAGCACAGTTA 
CCCCAGCGAGCAGAGTCAAGAAGATGTxT'GTGTGGGTTTTGTTAGTCTGGATTCGGA 
CCTTGTACTGTCTGATTTACTAGACAGAGAAACT^ 
ATTGTTTAC^GGCTTAACTAATTTATtxX3CTTTTTTAAACTt^ 

TTGGATTATGATTxTAAGAAATTATTAATTTATGAAATGATAGGTAAGGAGAAGCTGGATTATCTCCTGT 
TGAGAGCAAGAGATTCGTTxTGACATAGAGTGAATGCATTTTCCCCTCTCCTCCTCCCTGCTA 
attttggggt tatgt tttgcttctttaag atag aaat c C CAGTT CTC TAATTTGGTTTTCT TCTTTGGGA 
aacoiaacatacaaatgaatcagtatcaattagggcctgggg 

AGTTAGTGATTCCCTCTCTTTCTAGTTTGGTAGGAATCAC^ 
TGGGTTTTTAATTTTCCTAGAATGAAGTGACTG&A^ 

AATGTATTTAGAAATATATTTAGTTTTATAG<^GAAGCAGCTCAATTGTTTGGTTGGAAAGTAGGGGAAA 
TT GAAGT TGT AGT CACTGTC TGAGAATGGCTATGAAG CGT CATT T CACAT TTTAC C C C AACTGAC CTG CA 
TGCCCA.GGACACAAGTAAAACATTTGTGAGATAGTGGTGGTAAGTGATGCACTCGTC 
TATAAGAAACACTGTGAAAAGTT CATATTCATCCATTGTGATTCTTT CC^ 

GGATT C C CACAGTAATAT AGACTGTG CATGGTGTGTATATTT CATTG CGATTT CCTGT TAAGATGAGTTT 
GT ACTCAGAATTGAC CAATTCAGGAGGTGTAAAAATAAACAGTGTTCTCTTCTCTACC ^ 



-to be continued 
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Figure 3: Human POSH cDNA Sequence (SEQ ID NO:3) 

CTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCTGGCTAA.GGCATG 

TTAGTGAAAGTGGTCTTTTATGT CCTCCCAG CAGACAGACATCAAGGATGAGTTAACCAGGAGACTACTC 
CTGTG ACTGTGGAGCT CTGGAAGGCTTGG TGGGAGTGAATTTGC CCACACCrrTACAATTGTGGCAGGAT C 
CAGAAGAGC C?TGTCTTTTTATAT C CATT C CTTGATGT CATTGG C CT CT CC CAC CGATTTCATTACGGTG C 
CACGCAGT CATGGAT CTGGGTAGT C CGGAAAACAAAAGGAGGGAAGACAGC CTGGTAATGAATAAGATC C 
TT ACCACAGTTTTCT CATGGGAAATACAT AATAAAC C CTTTCAT CTTTTTTTT TTT C C TTT AAGAATTAA 
AA CTGGGAAATAGAAACATGAACTGAAAAGT CTTG CAATGACAAGAGGTTT CATGGTCTTAAAAAG ATAC 

TCCTGTGTGTTGAATOTAAAAAAAAAAAATACTTTACT 

AAATGAACTTTAGTTAGGAAAAAG CTGGCAT CAG CTTTCATC TGTGT AAGTTGACAC CAATGTGT CATAA 
TATTCTTTATTTTGGGAAATTAGTGTATT 

GATAA'iTr'rX-rTACCTGT CTTTT CTC CATAT TTTAAGCTATGTGATTGAAGT ACCT CTGTT CATAGTTT C 
CTGGT ATAAAGT TGGTTAAAATTT CAT CTGTT AATAGAT CATTAGGT AATATAATGTATGGGTTTT CTAT 
TGGTTTTTTGCAGACAGTAGAGGGAGATTTTGTAACAAGGGCTTG 

AATTG CAATT TAT CACTC CTTTT CATGTT AATAATTTGAGGACTGGATAAAAGGTTTCAAGATTAAAATT 
TGATGTT CAAACCTTTGT 
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Figure 4: 5' cDNA fragment of human POSH (public gi:1043261 1; SEQ ID NO:4) 

ctgagagacactgcgagcggcgagcgcggtggggccgcatctgcatcagcc^ccgcagccgctgcggggc 
cgcgaacaaagaggaggagccgaggcgcgagagcaaagtctgaaatggatgttacatgagtcattttaag 
gatgcacacaactatgaacatttctgaagatttt-ttctcagtaaagtagataaagatggatgaatcagcc 
ttgttggatcttttggagtgtccggtgtgtctagagcgccttgatgcttctgcgaaggtcttgccttgcc 
agcatacgttttgcaagcgatgtttgctggggatcgtaggttctcgaaatgaactcagatgtcccgagtg 
caggactcttgttggctcgggtgtcgaggagcttcccagtaacatcttgctggtcagacttctggatggc 
atcaaacagaggccttggaaacctggtcctggtgggggaagtgggaccaactgcacaaatgcat.taaggt 
ctcagagcagcactgtggctaattgtagctcaaaagatctgcagagctcccagggcggacagcagcctcg 
ggtgcaatcctggagccccccagtgaggggtatacctcagttaccatgtgccaaagcgttatacaactat 
gaaggaaaagagcctggagaccttaaattcagcaaaggcgacatcatcattttgcgaagacaagtggatg 
aaaattggtaccatggggaagtcaatggaatccatggctttttccccaccaactttgtgcagattattaa 
accgttacctcagcccccacctcagtgcaaagcactttatgactttgaagtgaaagacaaggaagcagac 
aaagattgccttccatttgcaaaggatgatgttctgactgtgatccgaagagtggatgaaaactgggctg 
aaggaatgctggcagacaaaataggaatatttccaatttcatatgttgagtttaactcggctgctaagca 
gctgatagaatgggataagcctcctgtgccaggagttgatgctggagaatgttcctcggcagcagcccag 
agcagcactgccccaaagcactccgacaccaagaagaacaccaaaaagcggcactccttcacttccctca 
ctatggccaacaagtcctcccaggcatcccagaaccgccactccatggagatcagcccccctgtcctcat 
cagctccagcaaccccactgctgctgcacggatcagcgagctgtctgggctctcctgcagtgccccttct 
caggttcatataagtaccaccgggttaattgtgaccccgcccccaagcagcccagtgacaactggcccct 
cgtttactttcccatcagatgttccctaccaagctgcccttggaactttgaatcctcctcttccaccacc 
ccctctcctggctgccactgtccttgcctccacaccaccaggcgccaccgccgccgctgctgctgctgga 
afc9ggaccgaggcccatggcaggatccactgaccagattgcacatttacggccgcagactcgccccagtg 
tgtatgttgctatatatccatacactcctcggaaagaggatgaactagagctgagaaaaggggagatgtt 
tttagtgtttgagcgctgccaggatggctggttcaaagggacatccatgcataccagcaagataggggtt 
ttccctggcaattatgtggcaccagtcacaagggcggtgacaaatgcttcccaagctaaagtccctatgt 
ctacagctggccagacaagtcggggagtgaccatggtcagtccttccacggcaggagggcctgcccagaa 
gctccagggaaatggcgtggctgggagtcccagtgttgtccccgcagctgtggtatcagcagctcacatc 
cagacaagtcctcaggctaaggtcttgttgcacatgacggggcaaatgacagtcaaccaggcccgcaatg 
ctgtgaggacagttgcagcgcacaaccaggaacgccccacggcagcagtgacacccatccaggtacagaa 
tgccgccggcctcagccctgcatctgtgggcctgtcccatcactcgctggcctccccacaacctgcgcct 
ctgatgccaggctcagccacgcacactgctgccatcagtatcagtcgagccagtgcccctctggcctgtg 
cagcagctgctccactgacttccccaagcatcaccagtgcttctctggaggctgagcccagtggccggat 
agtgaccgttctccctggactccccacatctcctgacagtgcttcatcagcttgtgggaacagttcagca 
accaaaccagacaaggatagc 
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Figure 5: N terminus protein fragment of hPOSH (public gi:10432612; SEQ ID 
NO:5) 

MDBSAI^IJ^CPVCIiBRLDASAKVLPCQHTF 
RLLTCIKQRPWKPGPGGGSGTNCTNAIj^ 

AIiYNYEGKEPGDIiKFSKGDI i ilrrqvdenwyhgevngihgffptnfvqi ikpi*pqpppqckai*ydfbvk 

DKEADKDCIiPFAKDDVLTVIRRVDBNWAEGMIiADKIG I FP I S YVE FNSAAKQI*I EWDKPPVPGVBAGECS 
SAAAQSSTAPKHSOTKKNTKKRHSFTSLTMAN^ 

CSAPSQVHISTTGIjIVTPPPSSPVTTGPSFTFPSDVPYQAAIiGTIJsrPPLPPPPI^ 
AAAAGMGPRPI^GSTDQIAHLRPQTRPSVYVAIYPYTPRK^ 

SKI GVFPGNYVAP VTRAVTNAS QAKVPMSTAGQTS RG VTMVS P STAGGPAQKLQGNGVAGS PSWPAAW 
SAAHXQTS PQAKVIiLHMTGQMTVNQARNAVRTVAAHNQERPTAAVTP IQVQNAAGLSPASVGLSHHSIiAS 
PQPAPLMEKSSATHTAAJ^ISRASAPIiACAAAAPLT^ 
GNSSATKPDKDS 
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Figure 6: 3* mRNA fragment of hPOSH (public gi:7959248; SEQ ID NO:6) 

atttcatatgttgagtttaactcggctgctaagcagctgatagaatgggataagcctcctgtgccaggag 

ttgatgctggagaatgttcctcggcagcagcccagagcagcactgccccaaagcactccgacaccaagaa 

gaacaccaaaaagcggcactccttcacttcccfccactatggccaacaagtcctcccaggcatcccagaac 

cgccactccatggagatcagcccccctgtcctcatcagctccagcaaccccactgctgctgcacggatca 

gcgagctgtctgggctctcctgcagtgccccttctcaggttcatataagtaccaccgggttaattgtgac 

cccgcccccaagcagcccagtgacaactggcccctcgtttactttcccatcagatgttccctaccaagct 

gcccttggaactttgaatcctcctcttccaccaccccctctcctggctgccactgtccttgcctccacac 

caccaggcgccaccgccgctgctgctgctgctggaatgggaccgaggcccatggcaggatccactgacca 

gattgcacatttacggccgcagactcgccccagtgtgtatgttgctatatatccatacactcctcggaaa 

gaggatgaactagagctgagaaaaggggagatgtttttagtgtttgagcgctgccaggatggctggttca 

aagggacatccatgcataccagcaagataggggttttccctggcaattatgtggcaccagtcacaagggc 

ggtgacaaatgcttcccaagctaaagtccctatgtctacagctggccagacaagtcggggagtgaccatg 

gtcagtccttccacggcaggagggcctgcccagaagctccagggaaatggcgtggctgggagtcccagtg 

ttgtccccgcagctgtggtatcagcagctcacatccagacaagtcctcaggctaaggtcttgttgcacat 

gacggggcaaatgacagtcaaccaggcccgcaatgctgtgaggacagttgcagcgcacaaccaggaacgc 

cccacggcagcagtgacacccatccaggtacagaatgccgccggcctcagccctgcatctgtgggcctgt 

cccatcactcgctggcctccccacaacctgcgcctctgatgccaggctcagccacgcacactgctgccat 

cagtatcagtcgagccagtgcccctctggcctgtgcagcagctgctccactgacttccccaagcatcacc 

agtgcttctctggaggctgagcccagtggccggatagtgaccgttctccctggactccccacatctcctg 

acagtgcttcatcagcttgtgggaacagttcagcaaccaaaccagacaaggatagcaaaaaagaaaaaaa 

gggtttgttgaagttgctttctggcgcctccactaaacggaagccccgcgtgtctcctccagcatcgccc 

accctagaagtggagctgggcagtgcagagcttcctctccagggagcggtggggcccgaactgccaccag 

gaggtggccatggcagggcaggctcctgccctgtggacggggacggaccggtcacgactgcagtggcagg 

agcagccctggcccaggatgcttttcataggaaggcaagttccctggactccgcagttcccatcgctcca 

cctcctcgccaggcctgttcctccctgggtcctgtcttgaatgagtctagacctgtcgtttgtgaaaggc 

acagggtggtggtttcctatcctcctcagagtgaggcagaacttgaacttaaagaaggagatattgtgtt 

tgttcataaaaaacgagaggatggctggttcaaaggcacattacaacgtaatgggaaaactggccttttc 

ccaggaagctttgtggaaaacatatgaggagactgacactgaagaagcttaaaatcacttcacacaacaa 

agtagcacaaagcagtttaacagaaagagcacatttgtggacttccagatggtcaggagatgagcaaagg 

attggtatgtgactctgatgccccagcacagttaccccagcgagcagagtgaagaagatgtttgtgtggg 

ttttgttagtctggattcggatgtataaggtgtgccttgtactgtctgatttactacacagagaaacttt 

tttttttttttaagatatatgactaaaatggacaattgtttacaaggcttaactaatttatttgcttttt 

taaacttgaacttttcgtataatagatacgttctttggattatgattttaagaaattattaatttatgaa 

afcgataggtaaggagaagctggattatctcctgttgagagcaagagattcgttttgacatagagtgaatg 

cattttcccctctcctcctccctgctaccattatattttggggttatgttttgcttctttaagatagaaa 

tcccagttctctaatttggttttcttctttgggaaaccaaacatacaaatgaatcagtatcaattagggc 

ctggggtagagagacagaaacttgagagaagagaagttagtgattccctcfcctttctagtttggtaggaa 

tcaccctgaagacctagtcctcaatttaattgtgtgggtttttaattttcctagaatgaagtgactgaaa 

caatgagaaagaatacagcacaacccttgaacaaaatgtatttagaaatatatttagttttatagcagaa 

gcagctcaattgtttggttggaaagtaggggaaattgaagttgtagtcactgtctgagaatggctatgaa 

gcgtcatttcacattttaccccaactgacctgcatgcccaggacacaagtaaaacatttgtgagatagtg 

gtggtaagtgatgcactcgtgttaagtcaaaggctataagaaacactgtgaaaagttcatattcatccat 

tgtgattctttccccacgtcttgcatgtattactggattcccacagtaatatagactgtgcatggtgtgt 

atatttcattgcgatttcctgttaagatgagtttgtactcagaattgaccaattcaggaggtgtaaaaat 

aaacagtgttctcttctctaccccaaagccactactgaccaaggtctcttcagtgcactcgctccctctc 

tggctaaggcatgcattagccactacacaagtcattagtgaaagtggtcttttatgtcctcccagcagac 

agacatcaaggatgagttaaccaggagactactcctgtgactgtggagctcfcggaaggcttggtgggagt 

gaatttgcccacaccttacaattgtggcaggatccagaagagcctgtctttttatatccattccttgatg 

tcattggcctctcccaccgatttcattacggtgccacgcagtcatggatctgggtagtccggaaaacaaa 

aggagggaagacagcctggtaatgaataagatccttaccacagttttctcatgggaaatacataataaac 

cctttcatctttttttttttcctttaagaattaaaactgggaaatagaaacatgaactgaaaagtcttgc 

aatgacaagaggtttcatggtcttaaaaagatactttatatggttgaagatgaaatcattcctaaattaa 

ccttttttttaaaaaaaaacaatgt'atattatgttcctgtgtgttgaatttaaaaaaaaaaaatacttta 

cttggatattcatgtaatatataaaggtttggtgaaatgaactttagttaggaaaaagctggcatcagct 

ttcatctgtgtaagttgacaccaatgtgtcataatattctttattttgggaaattagtgtattttataaa 

aattttaaaaagaaaaaagactactacaggttaagataatttttttacctgtcttttctccatattttaa 

gctatgtgattgaagtacctctgttcatagtttcctggtataaagttggttaaaatttcatctgttaata 

gatcattaggtaatataatgtatgggttttctattggttttttgcagacagtagagggagattttgtaac 

aagggcttgttacacagtgatatggtaatgataaaattgcaatttafccactccttttcatgttaataatt 

tgaggactggataaaaggtttcaagattaaaatttgatgttcaaacctttgt 
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Figure 7: C terminus protein fragment of hPOSH (public gi:7959249; SEQ ID 
NO:7) 



RHSME I SPPVIiISSSNPTAAARI SELSGLSCSAPSQVHISTTGLI VTPPPSSPVTTGPSFTFPSDVPYQA 
ALGTLOTPLPPPPLLAATVLASTPPGATAAAAAAGM^ 

EDELE LRKGE M FL VFERCQDGWF KGT S MHTS KI G VFP GNYVAP VTRAVTNAS QAKVP M ST AGQTS RGVTM 
VS PSTAGGPAQKLQGNGVAGS PS WPAAWS AAH IQTS PQAKV^ R 
PTAAVTP I QVQNAAGLSPASVGLSHHSIiASPQPAPLMPGSATHTAAI S ISRASAPIACAAAAPIiTSPS IT 
SASLEAEPSGRIVTVLPGLPTSPDSASSACGNSSATKPDKDSKKEKKGIJjKIjLSGAST^ 
TLEVELGSABLPLQGAVGPEIiPPGGGHGRAGS CP VDGDG P VT T AVAGAALAQD AFHRKAS S LD S AVP IAP 
PPRQACS SLGPVLNE SRP WCERHRVWS YP PQSEAELBIiKEGD I VFVHKKREDGWFKGTIiQRNGKTGLF 
PGSFVENI 
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Figure 8: Human POSH fiill mRNA, Annotated Sequence (part 1) 

- gi| 10432611 | dbj | AK021429 . 1 1 AK021429 Homo sapiens cDNA 
FL0TH367 fis, clone HEMBA1000303 , highly similar to Mus musculus 
Plenty of SH3s (POSH) mRNA 

- gi | 7959248 | dbj | AB040927 . 1 | AB040927 Homo sapiens mRNA for 
KIAA1494 protein, partial cds 

WzSz - Both hPOSH and KIAA1495 

t&asgj - Ring Domain 

- SH3 Domian 

* start codon and stop codon of predicted ORF 

CTGAGAGACACTG CGAGCGGCGAGCG CGGTGGGG CCGCATCTG CATCAGCCGC CG CAGCC6CTGCGGGGC 
CG CGAACAAAGAGGAGGAG C CGAGG CGCGAGAG CAAAGTCTG AAATGGATGTT ACATGAGT CATTTTAAG 
GGAT GCACACAACT ATGAACATTTCTGAAGATTTTT TC TCAGTAAAGTAGATAAAC^SI§GATGAATCAGC 
CTTGTTGGATCTTTTGGA (^ &^&&^ 



^™^ GGCCTTGC ^ COTGGTCOTOTTGGG< ^ GT< ^ 

^TGCflA^TGGAixCCCC^ 




^^^^^^^ ^^^^^^^^^^^^^^^^^^^^OTGGAGCTGGGCA^^^^^^^^C 
r^™^ ACCGGTCACGACTGCAGTGGCAGGAG ^ G 



-to be continued 
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Figure 8: Human POSH foil mRNA, Annotated Sequence (part 2) 



TTGTGGACTTCCAGATGGTCAG G AGATGAGCAAAGGATTGGTATGTG ACT CTGATG C C C CAGCACAGTTA 
CCC CAG CK^GCAGAGTGAAGAAGATGTTTGTGTGGGT TTTG TTAGTCTGGATT CGGATGTATAAGGTGTG 
C CTTGTA CTGTCTGATTTACT ACA CAGAG AAACTTTTTTTTTTTT TT AAGAT ATATGA CTAAAATGGACA 
ATTGTTTACAAGGCTTAACTAATlTATTTGCTTTTTTAAACTTGAACTT^ 

TTGGATTATGATTTTAAGAAATT ATTAATTTATGAAATGATAGGTAAGGAGAAG CTGGATTATCTCCTGT 
TGAGAG CAAGAGATT CXjTTTTGACATAGAGTGAATGCAT TTT C CC CT CT C CT C C T C CCTGCTAC CATTAT 
ATTTTGGGGTTATGTTTTGCTTCTTTAAGATAGAAATCCCAGTT 

AAC CAAACATACAAATGAATCAG TATCAATTAGGG C CTCGGGTAG AGAGACAGAAACTTGAGAGAAGAG A 
AGTTAGTGATTCCCTCTCTTTCTAGTTTGGTAGGAATCACCCTGAAGACCTAG 

TGGGTTT TTAATTTT C AGAATGAAGTGACTGAAACAATGAGAAAGAAT ACAG CACAAC C CT TGAACAA 
AATGTAT TTAGAAATATATTTAGTTTTATAG CAGAAGCAGCTCAATTGTT TGGTTGGAAAGTAGGGGAAA 
TTG AAGT TGTAGT CACTG T CTGAGAATGG CTATGAAG CGT CATTT CACATTTTA CC C CAACTGA C CTG CA 
TG C C CAGGACACAAGTAAAACAT TTG TGAGATAGTGGTGGTAAGTGATGCACT CGTGTTAAGT CAAAGGC 
XATAAGAAACACTGTGAAAAGTTCATATTCATCCATTGTGATTCT 

GGATTCC CACAGTAATATAGACTGTGCATGGTGTGTATATTT CATTGCGATTT CCTGTTAAGATGAGTTT 
GTACT CAGAATTGAC CAATT CAGGAGGTGTAAAAATAAACAGTGTT CT CTTCT CTAC C CCAAAGCCACTA 
CTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCTGGCTAAGGCATGC^ 
^AGTGAAAGTGGTCTTTTATGTCCTCCCAGCAGACAGACATCAAGGATGAGTTAAC 
CTGTGACTGTGGAGCTCTGGAAGGCTTGGTGGGAGTGAATTTGCCCACA 
.CAGAAGAGCCTGTCTTTTTATATCC^^ 

CACXSCAGTCATGGATCTGGGTAGTCCGGAAAACAAAAGGAGGGAAGACAGCC^^ 

TT AC CACAGTTTTCT CATGGG AAAT A CATAATAAAC C CTTT CAT CTTTTTTTTTTT C CTTTAAGAAT TAA 
AACTGGGAAATAGAAACATGAACTGAAAA.GTCTTGCAATGACAAGAGGTTTCATGG 



AAATGAACTTTAGTTAGG AAAAAGCTG GCAT CAG CTTTCAT C TGTGTAAGTTGACACCAATGTGTCATAA 
GATAATTITTTTACCTGTCTTTTCTCCATATTTTAAGCTATGTGATO 

^^f^lji^^ CATTAGGTAATATAATGTATGGGTTTT CTAT 
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Figure 9: Domain Analysis of Human POSH 
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Figure 10: Diagram of Human POSH Nucleic Acids 
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■ 

Figure 1 1 : Reduction in Full Length POSH mRNA by siRNA Duplexes 
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Figure 12: POSH Affects Release of VLP from Cells 
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Figure 13: Release of VLP from Cells at Steady State 
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Figure 14: Mouse POSH mKNA sequence (public gi:10946921; SEQ ID NO: 8) 

GGGCAGCGGGCTCGG03GGGCTGCM 

GCGAGAGCAAAGTCTGAAATGGATGTTACATGAATCACTTTAAGGGCTC 

AAG CCGTTTT CT CACTAAAGTCA CTCAAGATGGATGAGT CTG CCTTGTTGGAC CTT CTGGAGTGC CCTGT 
GTGTCTAGAACGCCTGGATGCTTC03CAAAGGTCTTACCCTC CCAGCATAC C"i"iTlHSCAAACX3CIX3TTTG 
CTGGGGATTGTGGGTTC CCGGAATGAACT CAGATGT C CCGAATG C CGGACTCTTGTTG GCT CTGGGGTCG 
ACGAGCTCCCCAGTAACATCCTACTCGTCAGACTTCTGGAT^ 

CCCTGGTGGGGGCGG CGGGACCACCTGCACAAACACATT AAGGG CGCAGGGCAGCACTGTGGTTAATTGT 
GG CTCGAAAGAT CTGCAGAG CT C C CAGTGTG GACAGCAG C CT CGGGTG CAAGCCTGGAG C C C CCCAGTGA 
GGGGAATAC CTCAGTT AC CGTGTG C CAAAG CATT ATATAACTACGAAGGAAAAGAGC C CGGAGACCTTAA 
GTTCAGCAAAGGCX^CACCATCATTCTGCGCCGACAGGTGGATGAGAATO 
GGGGTCCACXX3CTTTTTCCCCACTAACTTCX3TGCAGATCA^ 
GCAAAGCACTTTACGACTTTGAAGTGAAAGACAAGGAAGCTGACAAAGA 

TOACGTACTGACCGTGAT CCXjCAGAGTGGATGAAAACTGGGCTGAAGG CTGGCAGATAAAATAGGA 
ATATTTC CAATTT CATACGTGGAGTTTAACT CAG CTG C CAAG CAG CTGAT AGAGTGGGAT AAG C CT C C CG 
TGCCAGGAGTGGACACXKSCAGAATGCCCCTC^Ga^CGGCGaVGA^ . 
CAC CAAGAAG AACAC CAGGAAG CGACACT C CTT CAC C T C CCT CAC CATGG C CAACAAGTC T T CC CAGGGG 
TCCGAGAACCGCCACTCCATGGAGATCAGCCCTCCTGTGCT 
CCCGCATCAGCGAACTGTCCGGGCTCTCCTGCAGCXSCCCC^^ 

AATTGTGACCCCACCCCCTAGCAGCCCGGTGACAACTGGCCCTGCGTTCACGTTCCOT 
TACCAAGCTGCCCTTGGAAGTATGAATCCTCCACOTCCCCC 

CCTCCACCCCGTCAGGCGCTACTGCTGCTGTTGCTGCTGCTGCTGCCGCCGCCGCCGCTC 

AC CCAGG C C TGTGATGGGGT C CT C TGAACAGATTG CACATTT ACGG C CTCAGACT CGT C C CAGTGTATAT 

GTTGCTATATATC CGTACACT CC CCGGAAGGAAGACGAACTGGAGCTGAGGAAAGGGGAGATC 

TGTTTGAG CGTTG C CAGGACGGCTGGTACAAAGGGACAT CGATG CAT AC CAGCAAGAT AGGCGTTTT C C C 

TGGCAACTATGTGGCGCCCX3TCACAAGGGCGGTGACGAATGCCT 

GCGGGT CAGGCAAGT CGCGGGGT GAC CATGGT CAGC C CTTCCACTGCAGGAGG AC CT ACACAGAAG C C C C 
AAGGAAACGGCGTGGCCGGAAATCCCAGCGTCGTCCCCACGGCTGTGGTGTCAGCAGCTCATATCCAGAC 
AAGTC CTCAGGCTAAGGTCCTG CTGCACATGT CTGGGCAGATGAC^GTCAATCAGGC CCGCAATG CTGTG 
AGGACAGTTG CAG CACATAG C CAGGAACG CC C CACAG CAGCAGTGAC T C C CAT CCAGGT C CAGAATGC CG 
CCTGCCTTGGTCCTGCATCCGTGGGCCTGCCCCATCATTCrCTGGCCrCCCAACOT 
GGGTCCTGCTGCCCACGGTGCTGCCGTCAGGATCAGT^ 

G CTTC T CTGGCCT C C C CAAAT AT GAC CAG TG C CATGTTGGAGACAGAGC C CAGTGGT CG CACAGTGAC CA 
TCCTCCCTGGACTCCCCAC^TCTCCAGAGAGTGCTGC^TC^ 

AGACAAGGA CAGT AAGAAAGAAAAAAAGGGC CTACTG AAG CTGCT TT CTGGTG C CTC CAC CAAACG CAAG 

CCCCXSAGTCTCCCCTCCAGCATC^CCTACCCTGGATGTGGAGCTGGGTGCTGGGGAGGCT 

GAGCAGT AGGT C CTGAG CTG C CG CTAGGGGG CAG C CACGGCAG AGTGGGGT CATGC C C CACAGATGGTGA 

TGGT C CAGTGG C CGC TGGAACAG CAG C CCTAG C C CAGGATG C CTT C CAC CGCAAG ACAAGCT C C CTGGAC 

TCCGCAGTGCCCATTGCTCCACCACCTCGCCAGGCCTGCTCCTCCCTGGGCCCAGTCATGAATGAGGCCC 

GGCCTGTTGTTTGTGAAAGG CACAGGGTGGTGGTTTCCTACC CT C CT CAGAGTGAGG CCGAACTTGAACT 

CAAGGAAGGAGATATTGTGTTTGTTCATAAGAAACGAGAGGACGGCTGGTTCAA 

AATGGGAAGACTGGC CTTTT C C CAG GGAG CTTTG TGGAAAACAT CTGAGAAGACGGGA CACG G AGAAAG C 
TTATCATCACACCACGTGTGACTAAAGAGCACAAAGCAGTTT CATAGAAAGAG CACAT CTpTGGACTTC C 
AGAT C TT CAAGAA C CGAG CAGAAGATGGG CA C CTGACT C CAG AG CC C CGG CCTGGTTACC C CAGGGG CAG 
AGGGAAGGAGGACACACCTGTGTGGGTTCCGTCTCTCTGGGTTCTGATGTG 

AGG CT TAACT AAT TTATTTG CT TTTT TAAAAC T TGAACTTT C T TG T AATAGCAAAT 
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Figure 15: Mouse POSH Protein sequence (Public gi: 10946922; SEQ ID NO: 9) 

MDESALLDIiloECPVCEiER^^ 
RIjLDGIKQRPWKPGPGGGGG^ 

ALYNYEGKEPGDIiKPSKGOTI ILRRQVDBN^rraGEVSGVHGFFPTNFVQI IKPLPQPPPQCKALYDFEVK. 
DKEAD KDCIiP FAKDD VLTVI RRVDBNWAEGMLADKI G I FP I S YVE FNS AAKQL IEWDKP P VPGVDTAE CP 
SATAQSTSASKHPDTKKNTRKRHSFTSLTMANKSSCXSSQNRHSl^ 
CSAPSQVHISTTGLIVTPPPSSPVTTGPAFTFPSDVPYQAALGSMOT 
VAAAAAAAAAAGMGPRPVMGSSEQIAHIiRPQTRPSyYVAIYPYTPRKEDEIiEL 

KGTSMHTSKI GVFPGNYVAPVTRAVTNASQAKVSMSTAGQAS RGVTMVSP STAGGPTQKPQGNGVAGNPS 

VVTTAWSAAHIOTSPQAKVLLHMSGQMTWQARNAV^ 

PHHSIiASQPLPPMAGPAAHGAAVSISRTNAPMAO^AGAS 

SAASACGNS SAG KPDKDS KKE KKGLLKIjLSGASTKRKPRVS P PAS PTLD VELGAGEAPLQGAVGPE IiPLG 
GSHGRVGSCPTDGDGPVAAGTAAIiAQDAFHRKTSSIiDSAVP I APPPRQACS SLGP VMNEARP WCERHRV 
VVS YPPQSEAELELKEGDI VFVHKBCREDGWFKGTIjQRNGKTGLFPGS FVENI 
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Figure 16: Drosophila melanogaster POSH mRNA sequence (public gi:17737480; 
SEQK>NO:10) 



CATTTGT ATC CGCTTGGC CACGAGCTTTGG CTG CACTTGGCAAACTT AATAAATTAAACATTGAATC CTG 
CCTATTGCAACGATAATAT AAT CTGATTTAGTG CATTAAGAACGACAAGTAG CGATTATAATAGTAGATT 
TTAGCATTTG AGCTAAATTTATTTC C CAAC CGCGTCTTGGGATTGCGTATG CGTGAGC CAGTACCTGCAT 
GTGTGTGTGTTTTGGAATGTGG C C C TGCACG AAATTCAAATAGTGAC CATC CTTGAGATTTTG CAT ACTG 
GCAAGATGGACGAGCACACGTTAAACGACCTGTTGGAG 

ATCGAAGGTGCTG C CATG C CAGCACAC CTTCTGC CGCAAATGCTTGCAGGAC3kTTGTGGCCAGTCAG CA C 
AAGTTG CGATGCCCGGAGTG C CG CAT C CTGGTCTCTTGCAAAATTGATGAG CTGC CTCCAAACGTCTTGC 
TGATG CG AAT CTT AGAAG G CATGAAA CAAAATGCAG C!AG CTGG (ZAAAGGAGAAGAAAAGGGAGAGGAGAC 
TGAAAGACAGCCGGAAAGGGCCAAACCTCAGCCGCCAGC^ 

CTCCAGCTGCAGTCACAT CAG CAATCTCATCAGC CGGCT CGTCACAAGCAACGTCGATTTCTACTCCCCC 

ACGCCTATGCCCTCTTTGACTTCGCCTCCGGTGAAGCCACCGATOTAAAGTTCAAGAAAGGGGAT 

ACTGATCAAGC^TCGCATCX^CAACAA 

AT CAACT ACGTCAAGG TAT CGGT T C CG CTGC C CATGC CG CAG TG CAT TGC CATGT ATGACTTTAAGATGG 
GG C C CAACGACGAGGAGGGAT G C CT CGAATTTAAGAAAAG CACTGTAAT ACAGGTAATG CGC CGAGTTGA 
T CATAATTGGGCAGAAGGACGAATTGG C CAGAC CAT CGGAAT CTT T C CAATAG CATT CGTTGAG CTGAAT 
GCAGCGGCCAAAAAGCTGTTGGACAGCGGGCTAC^CA 

GGCAG CGGG C CCTTC CT C CGGT T C CAGTTATTGAT CC CACGGTGGT CACGGAAT C CAGTT CGGGATCCT C 

CAATTCCACGCCGGGCAGC^GCAATTCAAGCTCC^C^TCC^^ 

ATCTCAC!TGCCGAATACCCCCCAACATGTAGTAGCTTCCGGATCX3^ 

GAGCAAAGGAGAAACGC CACT CA CTAAATGCTTTG CTGGGAGGAGGAGCT C CATTAAGTCTGCTGCAGAC 
CAACraCCATTCGGCTGAAATTCTTAGCCTGCCCCATGAACTAAGCCGCTTGGAAGTT 
GCTCTAAAACCCACGTCAGCC C CACAGACATCG CGTGTACTTAAGACCACTGTTCAGCAGCAGATGCAAC 
CGAATTTACC CTGGGGATACTTAGCCCTGTT CCCATACAAAC CACGC CAAA CGGATGAG CT GGAATTAAA 
AAAGGGTTGTGTTTACATTGTGACCGAACXIATGTGT^ 
ATGACTGGAGTGTTCCCGGGCAACTACCTGACGCCCCTC 

AATGGAAATATGT TC C C CAAAAT G CAGACGC C CAGATGGCA CAAGTA CAG CAG CAT C CAGTTGCAC CAGA 
TGTGCG^CTCAACAACATGCTGTCCATGGAACCGCCT 

ACGACCACCAGTTGCTCTGTGTGGTCGAAAC CAGTGGAGGCG CTGTT CAGCAGAAAATCGGAGCCCAAGC 
CTGAAACTGC CACAG C TT CGACTACG AG CAG CAGTT C CT CTGGAG CAGTGGGACTTATGAGGAGATTAAC 
T CACATGAAAACA CGCT C CAAAT CT C CGGGAG CGT C C TTG CAGCAAGTT C CGAAAGAAGCTATTAG CACA 
AATGTGG AATTTACAACAAAC C CAT CAG CTAAAT TG CAT CCAGTA CATGT AAGAT C CGG C T CGTG C C C C A 
GTCAGCTGCAGCAGAGTCAACCGCTCAATGAAACTCC!AGC^^ 

CCTACCCAAGCAGCTGCCTTC CGCTT CTAOTAACAGCGTTTCGTACGGATCGCAACGCGTGAAAGGAAGC 
AAGGAACGTC CTCACTTGATTTGCGCGAGACAATCAT TAGATG CAGCTACATTTCGCAGTATGTACAACA 
ATGCCGCGTCGCCGCCGCOVCCTACTACTTCCGTGGCCCC^GCH^TCTACG^ 

GATTCCTGGAGGTGGAGCG CAAT CCCAGTTG CATGCCAATATGATTATTGCAC CCAGCCATCGGAAGT CG 
C^<^GCCTAGATGCX3AGTC^TGTGCTGAGTC CCAGCAGCAATATGATCACGGAGG CGG C CATTAAGGCCA 
GCG CCACCACTAAGTCTC CTT ACTGCACGAGGGAAAGT CGATTCCGCTG CATTGTGCCGTATCCACCAAA 
CAGTGACATTGAACTAGAGCTAGATTTGGGCGACATTATCT 

TATAAGGGCACCCATGCCCGTAC CCACAAAACCGGGCTGTTC CCCG CCTC CTTTGTTGAAC CGGATTGTT 
AGG AAAGT TATGGTT CAAACTAG AATT TATTAAGCGAAATT C CAAATTACTTGTC TAAAAG GATT CAAT C 
GT CGGT CTATTCGGGCTTCCAAATACG CAAT CTCATATT TCT CTTTT CAAAAAAGAAACCGTTTTGTACT 
CTTCCAATCGAATGGGCAG CTCGCTOTTGTACTTTTTTATACAATGCTTGATCAAAAT AGG CTAGCCATG 
TAAGACTTAGGGAACAGTTAC TT AAG C CT TAG CGATTAGTTAGCTAGAGAAATAATCTAACCGATC CTTG 
TGCCCTCTACAAAGTTATTTGTAATATACGATACTCAGTAATA*^^ 
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Figure 17: Drosophila melanogaster POSH protein sequence (public gi:17737481- 
SEQIDNOill) 

^EHTLNDIjLECSVCIjERIjDTTSKVIjPC^ 

RI LEGMKQNAAAGKGEE KGBETETQPERAKPQPPABSV^PDNQLIjQIjQSHQQSHQPARHKQRRFLLPHA. 

YALFDPASGEATDLKFKKGDL 1 1» I KHRI DNNWFVGQANGQEGTFP INYVKVSVPLPMPQCIAMYDFKMGP 

NDEEGCIjEFKICSTVIQVMRRVDHNWAEGRIGQTIGIFPIAF^ 

RALPPVPVIDPTVVTBSSSGSSNSTOGSSNSSSTSSSl^CSPNHQISLPOTPQB^^ 

KEKRHSIiNALLGGGAPLSLLQTNRHSAE I LSLPHELSRLEVS SSTALKPTSAPQTSRVLKTTVQQQMQPN 

LPWGYIiAIiFPYK^RQTDEIjEIiKKGCVYrVTERC^n>^ 

KYVPQNADAQMAQVQQHPVAPDVRLNNMLSMQPPDLPPRQQQATATTT 

TATASTTSSSSSGAVGIiMRRIiTHMKTRSKSPGASIiQQVPKEAISTNTO . 

IiQHBQPLNETPAAKTAAQQQQFLPKQLPSASTNSVSYGSQ 

ASPPPPTTSVAPAVYAGGQQQVIPGGGAQSQIiHANM^ 

TTKS P YCTRE SRFRC I VP YP PNSD I ELELHLGD 1 1 YVQRKQKNGWYKGTHARTHKTGL FP AS FVE PDC 
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Figure 18: POSH Domain Analysis 
hPOSH protein sequence : 
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Figure 19: Human POSH has ubiquitin Iigase activity 
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Figure 21. PLD activity in medium of transfected cells 
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Figure 22. 
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Figure 23. 
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Figure 24. 
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Figure 25. 

H187-2 H153-1 
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FIGURE 26 
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Figure 27. 
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Figure 28. 
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Figure 29A. 
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Figure 29B. 
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Figure 30. Putative PKA phosphorylation sites in hPOSEL 



MDESAIjIiDIjLECPVCI,ERIJXA£AKVI,PC^^ 
RLLTCIKQRPWKPGPGGGSGTNCTNALRS^ 

ALYNYBGKEPGDIJtFSKGDIIILRRQVDENWYHGEVNGIHGFFPTN^ 

DKEADKD CLP F AKDDVIjT^^RRVDK^AEGMIiADK I G I FP I S YVE FNSAAKQIiI EWDKPPVPGVDAGE CS 
SAAAQSSTAPKHSDT pKKNj^ 

s " ' " ~* 

CSAPSQVHISTTGIjIVTPPPSSPVTTGPSFTFPSDVPYQAAI^^ 

AAAAGMGPRPMAGSTDQIAHLRPQTRPSVYVAIYPYTPRX^^ 

SKJGVFPGNYVAPVTRAVTNASQAKVPMSTAGQTSRGVTjWSPSTAG 

SAAH I QTS PQAKVLLHMTGQMTVNQARNAWTVAAHNQERPTAAVTP IQVQNAAGLS P AS VGLSHHSLAS 

PQPAPIiMPGSATHTAAISISRASAPLACAAAAPLTSPSITSASLEAEPS GRIVTV IiPGIiPTSPDSASSAC 

GNSSATKPDKDSKKEKKGLLKIiIiSGASTra 

(^CPVDGDGPVTTAVAGAAIiAQDAF HRKAi siiDSA^ 

PPQSElVELELKEGDIVFVHKKREDGWFKGTLQRNGKTGriFPGSFVENI 
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Figure 31. Phosphorylation of KPOSH regulates binding of GTP-loaded Rac-1 . 
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Figure 32. 
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Figure 34B. 
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Figure 35. 



Soiutioa No, Mock 



8 



PreSZ 




Dire « 



a Care 




-2 US 



45/202 



WO 2004/078130 
FIGtJRS 36 



PCI7US2004/006308 



Unigene Name; Arfl Unigene IDs Hs. 286221 

Human Arfl raRNA sequence - varl (public gi: 3360490) (SEQ ID NO: 325) 

GCAAAACCAACGCCTGGCTOSGAGCA^ 

CAAGCAira^AACATCTTCGCCAACCTCTTC^ 

GGTGGGCCTGGATGCTGCAGGGAAGACC^CGATCCTCTA^ 

ATTCCCACCATAGGCTTCAACGTGGAAACCGT^^ 

GTGGCC^GGACAAGATCCGGCCCCTGTGGO^ 

GGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGC^^ 

CTCCGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCA^ 

TCACAGACAAGCTGGGGCTGCACTCACTACGCC^ 

CGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCA^ 

CCCTCCXrrCTCACTCCTCTTGCCCT 

CC^GAAGCTGCCTCCGTGGTTTGGTCA^ 

GCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCA 
ATATTACTCAGCTTTTTTTATTGTAAAAAGAAAAATGAACTCA 

CCCATGGGCACCTGGCCTCGAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGT 

GTTGAAATCCATTTTGGTGGTTGGTTTTTAATC 

AGTCGAGAAC^CTTGAACACACAGAAGGGAGACCCCGCCTAG 

GAGTCGCCAGCCCAGCTGTTC^ 

GCATGGTCACAGTAGAGATCCCCGCAACTCGCTTGTC^ 
" TGTCCCTGTGCTCCCAGGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCC 
GCCCTACCCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCT 
GTGGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAG^ 
GGAGCGCGTCCAAG£CGGGCAGGCGGTCCACCTAGACCCA^ 
GTGATGTAGCTTTCTCTCCCTCAGCCTGC^ 

TTCTTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCA 

CTATTTGGTGTCXSTGGAACCTCTTACTGCTTTC^^ 

AAAAAAA 

Human Arfl mJRLNA sequence- var2 (public gi: 30583624) (SEQ ID NO: 326) 

ATGGGGAACATCTTCGCCAACCTCTTCAAGGGCCTT^ 

GCCTGGATGCTGCAGGGAA<3ACCACX3ATCCTCT^ 

CACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAA^ 

caggacAagatccggcccctgtggcgccactacttcca^ 
gc^tgacagagagcgtgtgaacgaggcccg'tgaggagctc^ 
ggatgk^tgtcctcctggtgttcgccaag 
gacaagctggggctgcactca'ctacgccacag 

acgggctctatgaaggactggactggctgtccaatcagctccggaaccagaagtag 

Human Arfl rrJRNA sequence - var3 (public gi: 34527605) (SEQ ID NO: 327) 

AJUVACCAACGCCTGGCTCGGAGCAGCAGCCTCTC^ 

agcatggggaacatcttcgccaacct 

tgggcctggatgctgcagggaagaccacgatcot 

tccg&ccataggctocaacgtggaaaccgtggagt 

ggccagrgaczagatccggcccctrgtggcgccactact 

acagc^^tgac^gagagcgtgtgaacgaggcccgtgaggagct^ 

ccgggatgctgtcctcctggtgttcgccaacaagcaggacctccccaacgccatgaatgcggccgagatc 

AC^GACAAGCTGGGGCTGCACTCACTACGCCACAGGAACT 

GO^CGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGOTCCGGAACCAGAAGTGAA 

CTCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCT 

AGAAGCTGCCTGCGTGGTTTGGTCACCGTGTGC^TCG^ 

GGCCAGGCTTTTTATTTMTGTAAATAGTT^ 

ATTACTCAGCrrTTTTTATTGTAAAAAGAAAAATCAA.CTCACT^ 

ATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGTGTT 
GAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTAAGAATCCAAGT. 
CGAGAACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGCCAG 
TCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGCA 
TGGTCACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACCCTGCATTCCATAGCCATGTGCTTGT 
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CCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTCGGAGCC 
CTACCCACCTTCAGGGAGCCTATGGGACGCAGGGCCCCATC 
GGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGAGACTTC 
GCX3CGTGCAAGCCX3GGCAGGC(X»^ 

ATGTAGCTT^CTCTCCCTCAGCCTGCAAGGGTCCGATTTeCCATCGA 

TTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTT^^ 

TTTGGTGTCGTGGAACCTCTTACTGCITTCAATA 

CAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATA 

Human Arfl mRNA sequence - var4 (public gi: 6995997) (SEQ ID NO: 328) 

GCAAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTC 

CAAGCATGGGGAACATCTTCGCCAACCTCTTCAA^ 

GGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCITAA 

ATTCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAGCTTCACTGTGTGGGAC^ 
GTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACT 

GGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGA 

CTCCGGGATGCTGTCCTCCTGGTGTTCGC 

TC^CAGACAAGCTGGGGCTGCACTCACTACGCCACAGGAACT 

CGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAAT^ 

CCCTCCCTCTC^CTCCTCTTGCCCTCTGCTTT^ 

CCAGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATC^ 

GCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTT^ 

ATATTACTCAGCTTTTTTTATTGTA^ 

CCCATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCC^ 

TTGAAATCCATTTTGGTGGTTGGTTTTAACCCAAACTCAGTGCATTTTTTAM 

TCGAGAACACTTGAACAGACAGAAGGGAGACCCCGCCTA 

GTCGCCAGCCCAGCTGTTCCCCTCGGGAAG^^ 

ATGGTCACAGTAGAGATCCCCGCAACTCXSCTTGTCC^ 

TCCCTGTGCTCCCACGGTTCdCTVGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGC 

CCTACCCACCTTCAGGCAGCCTATGGGACGCAGGCCCCATCTGTCCCTCGGTCCGCGTGTGGCCAGAGTG 

GTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACH^TGGCAGGATGT 

CGCGTGCAAGCOTGGCAGGCGGTCCACCTAGACCCACAGCC 

TGTAGCTTTCTCTCCCTCAGCCTGGAAGGGTC 

TTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTAT 
TTGGTGTCGTAGGAACCTCTTAC^GCTTTCAATACACGATTAGTAATC^ACTGTTTTGTATACTTGTTTT 
CAGTTTT CATTTCGACAAACAAGCACTGTAATTATAG CTATTAGAATAAAATCT CTTAACTATT 

Human Arfl mKNA sequence - var5 (public gi: 7020834) (SEQ ID NO: 329) 

CCTTACCCGGCGTGCCCCGCGCCCGGAGGCGCTGACGTGGCCGCCGTCAGAGCCGCCATCTTGTGGGAGC 

AAAACCJ^CGCCTGGCTCGGAGCAGGAGCCTCTiGA 

AGCATGGGGAACATCTTCGCCAACCTCTTGAAGGGCCTTTTT^ 

TGGGCCTGGATGCTGCAGGGAAGACCAGGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCACCAT 
TCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATC^ 

GGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTCGTGGTGG 
ACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCT 
CCGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCCATGA 
ACAGACAAGCTGGGGCTGCACTCACTACGCCACAGGAACTGGTAC^ 

GCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAAC(^GAAGTGAACGCGACCCCC 
CTCCCTCTCACTCCTCTTGCCCTCTGCTT^ 

AGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGC 
GGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCC^TGAGGC^GTTTCTGGTACTCCTATGCAAT 

CATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGTGT 
TGAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTC^ 

TCGAGAACACTTGAACACACAGAAjGGGAGACCCCGCCTAGCATAGATTTGCAG 

GTCGCCAGCCCAGCTGTTCCCCTCGGGAACATC 

ATGGTCACAGTAGAGATCCCCGCAACT03CTTGTCCTTC 

TCCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGC 

CCTACCCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTCCCTCGGTCGCCGTGTGGCC^^ 

GGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTT^ 

AGCGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCAC^GCCCCTCGGGAGCACCCCACCTCTGTGTGT 
GATGTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACTTTTTT 
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CTTTTGTATTTTGATAAACACTG^ 

ATTTGGTGTCGTGGAACCTCTTACTGCTTTCAATAC^CGATTAGTAATCAAC 

TCAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATO 

AAAAAAAAAAAAAAAAAAAAA 

Human Arfl mRNA sequence - var6 (pubHc gi: 10435849) (SEQ ID NO: 330) 

AGCTCAGTGCCCAGCATGTCTGTGGTGAGTGTGT 

CTCTCTTCCTGGGTTCTTGCCACTCCCCTGAA 

GTGTGCGCTGACCATGCTGAGATGGGCACTGTGGACTCAGCCTOT 

CATGTGAGGTACAGGGGAACGCACTGCTAGCAGATGGTTGGG 

TTGGTGCTGTGCCATCGCACAGTCATTCGC^ 

GGCCCCTGGGAGTGCCTGCCCTCAATTTGGAAGAGCCCTTGGGCACAGC 
ACTGGGCCATGATCC^GGGCATTGGGACCTCACCTAGGA 

CAGGGTCTCCCCTGTGGGCACCAAAOTGACCTCAAACTGCTGGTTCTTTGGCCCTG 
TGAAGTACTCTCCCXSGCAGCTOTCACCTGCAGGGAGA 
TCCTCGTTGCAGTGTTAAATCCCTGCAGGTTCCrTATTCT 
GCTAATGTGGGTTTGCGTTTTTGGTCTTGGTTT^ 

CAGGTAGAGGCCAACTGCACCCTGCCTGCC^GAGTAGAAATACTGGTAGGCCCCAGGCTCTGCTGCCCCT 

TCCATGTCCTTGTGTAAGCATCCATGGACAAAGCTGACTCAC^ 

GAAACAGGGGTTTTATTCTAGAGGGCCTTGTGCTC^^ 

GGCTGGGGCAGCACAAAGGACTGGATAGCATTTGCCA.TGATGCCATGTG^ 

ATTGTAGCTGTGGTCAGAGGTCATGAGACACTGCCTTCAGCAGCCCTGGGAGTCCACCTGGTGTGTGCT^ 
AGAGCTGTGCATCTGCAGATTTCAGAAGGACTTACGTTTGG 

TACCAAGAAGCAAGAAATAGACAAATAAGCAGGTAATGGTTCTTTGGTGTTTACA 

CGGTTCTTTGGTGTTCACATTAGCTATAGTCCCAGAACTCAGTCCATGAGGTGGAATC^CAAAAATGGAA 

TTCATTTCTGGCTGTCAGTACACA/^CTGATTTAAGATATCACCTT 

TCTAAACTGAATTTCACTGATTGCCCCA 

AGGCCAGCAGCAGATGTGTTAGAGGGGACCCTTGTGCCTCGCAGCCCTCATCT 
CACTGTGTCGAGTTGTAATGCCTAATGAGCTCCTCTA 

GACTCCCAGTGGAACTCGCCTTCAGATGCAGCCCAGAATAAGAGTTCTAGAAT^ 

GTCTCAATCTGCATGATTGCAAGTCTCTTCAACATGATTGGGTGCGTGGAGTGTCTCGGTCATGTGTCTT 
CCCCTCTGAGCATGCCTTTTGATTCGCACCTGTGTC^CAAT^ 

TCACCAGTATCGGCAGATTTAGTTTTCCCTTTACGTGAGTTTTGGTAAAATAGTGACAAA 

GTGCTCAGTCACAGAAAAATGTCAGGCCTACAGAAATGGAGCATTTGGCTGGTGGGTAGCGTGATGACCA 

TAGGCTTTATTTGGCTGGTGTGGTAAACAAGCAGCAGCTTGTGC^ 

TTTCATTTTAAGGACTCCCTTAAAATGAAAATCTTCGTGTGGGACATGAACACAQGCTTTCACG 

ATCATCTACACTATATGTATGACTGTTGAAAGGCTGTTGTTCCTCAGAAATTCTTAAAATGTTATGTAAT 

GTACATGAGTCCCTTCAGGAAGTGATCAGCTTTGTTCAG^ 

ATGAACTATAAAGATGTGTGTAATTTATCTGTCAGTGAACTTGACOTTAATAAAAGCrTT 
ACTCTGGGTGGGGTGCATTGGCTCACACACATAGTCCCAACT 

TGGAGCCCAAGAGTTp^GATCAGCCTGGG<^GGATA6CGAGACCCTGTCTATAGAAAATATTAAAAATC 
AGCTAGGCATGGTGGCTTGCCCTTGCATTCCCT^ 

CCAGGAGCTCAAGGCTGCAATGGGCTGTGATCGAACCACTGAATTCCAACCTGGGTGACAGAGTGAGGCC 
CTGTCTCAAAAAGAGAACTCTCGATGTCACTGGCTTTCCATGTJ^ 

TCGTGGATGTCAGTCAGCAGAAC^GAATCTTGGACCTG'GAGCTTGTTTGTCCTGTGCTAGAGGTTGGAGG 

TGTCTCTGTCTTTCTGTTGGTTCCTGTCAGTTCAGGTGA.CTTAGAGATTCTGTTACATACACC^GCTCTG 

ACAGGTTGGGGGAGATGATCAACCTTCCGCCTGCGCCTGTTCCCTTCCCTGACTCATGCCAAAGTATCCC 

TGAGATCTGCAA.GGGACCGAGGACAGTACTGC^TGGTGGTCTGGGTACAGGCCACAGAGGC^ 

CCATGTGCATCTGGACCAGTTTGGTTGGATCCATTGATGGACACAAAACGGATGTG 

CATTTTCTCCCTGCCCCTGTTCAGGCACAGTGAGGTGTCGGGGAATGTAGGTGCCAGAGTTGACTGTCCC 
GTTCTTTGGTGTAATGCCTGAAGGCCACCTTTACCATTGGTCTGTGGTCCTCACTGAAGAAAGAAACATT 
CTTCCTAAAAGACTTTTTTTCCTCAGAGTTGGAGCCCACAGCGTGGTCAGGAAAGAGAAGTAGCCACTGG 
TGGCTCCTGGCATCCTCCTGCTGGGCAGCCCCTTCTCAAAGTGTG 

AAGGCTCTGAGAAAGTCAGGTTTGCTCCTACCACAGGATAATTCCGATGAACCTGAAAAGCGGGTTTTGG 
CTTGTGTGGAGGGACTCTGGTGGAAGATU^GGGTGACAGCACCTGGCCTGGGCATGACACAAGTTAGGACC 
CGTACCAAGAGGCCCTGGAATTGAGGGTGGGGGTTGCTGTGGACTCTTTCTCCCTCTTAGGAAACTCTAT 
TGGGTCTCCATCTGTCACAGAAGCAGTAAATGATGTAGGGGCTGCCAGGTATAGGGTCCTGTGGGGATGC 
TGGAACATGCCGAGGCAGGACGTGCCAGCCACCCTCTGCCCATATGTGCAGCAGGGCCACAGATGTGCTT 
GTCGGTAGGAGAGACCAAGCTGTCTGTGTGCCGATGTCTTGACACCTGAGACTTGAGGTTCACCCATCCT 
GGTTCTGCCATTCCATTGCAGGGTGGCTTCCCTCCTTTGGGGACTCTTAACGCTTTGGTCTGTTAAAAAA 
AAAAAAAAAAAAAAATCCGGGCGTGGTGGCTCACTCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGG 
CTGATCATCTGAGGTCAGGGGTTCGAGGCCAGCCCTGACCAACATGGTGAAACCCCGTCTCTACT 
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Human Arfl raRNA sequence - var7 (public gi: 14714585) (SEQ ID NO: 331) 

CAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCC 
GGGGAACATCTTCGCCAACCTCTTCAAGGGCCTTTTO 
CTGGATGCTGCAGGGAAGACCACGATCCTCTACT^GCT 
CCATAGGCTTCAACGTGGAAACOJTGGAGTACAAGAA^^ 

GGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTT 
AATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTGATG^ 
ATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCT 
CAAGCTGGGGCTGCACTGACTACGCCACAGGAAOT 

GGGCTCTATGAAGGACTGGACTGGCTGTCCAATC^GCTCCGGAACGAGAAGTGA 

TCTC^CTCCTCTTGCCCTCrGCTTTACT 

CTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACC^ 

GGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAGTTTC 

TCAGCTTTTTTTATTGTAAAAAGAAAAATCAACTC^ 

GCACCTGGCCTCmGGAGTCGCTGTGTTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCT 
TCCATTTTGGTGGTTGGTTTTTAACCCAAACTC 

AACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATTTGCAG 

CAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGC^GCAGACTGCGATCAATTCTGCATGG 

CACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACC 

GTGCTCCCACGGTTCCCAGGGGCCAGKjCTGGGA^ 

CCACCTTCAGGCAGCCTATGGGACGGAGGGCCCCATCTGTCCCTCGGTCGCCGTGTGGCCAGAGTGGGTC 
CGTCGTCCCCAAGACTCGTGCTCGCTCAGACACT 

GTGCAAGCCGGGCAGGCK3GTCCACCTAGACCC^CAGCCCCTCGGGAGCACCCCACCTCTGTGTGTGATGT 

AGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCTACTTTTTTCTTTT 

GTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTATTTG 

GTGTCGTGGAACCTCTTACTGCTTTC^TACA^ 

TTTCATTTCGACAAACAAGCACTGTAATTATAGCTAT^^ 

AAAAAAAAAAAAAAAAAAAAA 

Human Arfl mRNA sequence - var8 (public gi: 33872952) (SEQ ID NO: 332) 

GTCCAATCAGCTCCGGAACCAGAAGTGAACGCGACCCCCCTCCCTCTCACTC 

CTCTCATGTGGCAAACGTGCGGCTCGTGGTGTGAGTGCCAGAAGCTC 

GCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGCGGCGAGGCTT^ 

TTGTTTCCAATGAGGCAGTTTCTGGTACTCCTATG'C^^TATTACTC^GCTTTTTTTATTGTAAAAAGAAA 
AATGAACTCACTGTTCAGTGCTGAGAGGGGATGTAGGCCCATGC^ 

TTGGGAGAGCCGGCCACGCCCTTGGCTTTAGAGCTGTGTTGAAATCCATTTTGGTGGTTGGTTTTTAACC 
CAAACTCAGTGCATTTTTTAAAATAGTTAAGAATCCAAGTCGAGAACACTTGA^ 

CCCGCCTAGCATAGATTTGCAGTTACGGCCTGGATGCCAGTCGCCAGCCCAGCTGTTCCCCTCGGGAACA 
TGAGGTGGTGGTGGCGCAGCAGACTGCGATCAATTCTGCATGGTCACAGTAGAGATCCCCGCAACTCGCT 
TGTCCTTGGGTGACCCTGCATTCCATAGCCAT 

GCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCCCTACCCACCTTCAGGCAGCCTATGGGACGC 
AGGGCCCC^TCTGTCCCTCGGTCGCCGTGTGGCCAGAGTGGGTCCGTCGTCCCC^C^TCGTGCTCXSCT 
CAGACACTTTGGCAGGATGTCTGGGGCCTCACCAGCAGGAGCGCGTGCAAGCCGGGCAGGCGGTCCACCT 
AGACCCTVCi^CCCCTCGGGAGCACCCCACCTCTGTGTGTGATGTAGCTTTCTCTCCCTCAGCCTGCAAGG 
GTCCGATTTGCCATCGAAAAAGACAACCTCTACITTTTT 

GCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTATTTGGTGTCGTGGAACCTCTTACTGCTTTC 
AATACACGATTAGTAATCAACTGTTTTGTATACTTGTTTTCAG 
ATTATAGCTATTAGAATAAAATCTCTTAACTATTAiu^AAAAAAAAAAAAAAA ' 

Human Arfl mRNA sequence - var9 (public gi: 15030200) (SEQ ID NO: 333) 

GAGCCGCCATCTTGTGGGAGCAAAACCAACGCCrGGCT 
GTGTCCTTCCACCTGTCCACAAGCATGGGGAACATC 

AAGAAATGCGCATCCTCATGGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCT 

GGGTGAGATCGTGACCACCATTCCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAGC 

TTCACTGTGTGGGACGTGGGTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACAC 

AAGGCCTGATCTTCGTGGTGGACAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAG 

GATGCTGGCCGAGGACGAGCTCCGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAAC 

GCCATGAATGCGGCCGAGATCACAGACAAGCTGGGGCTGCACTCACTACGCCAQAGGAACTGGTACATTC 

AGGCCACOTGCGCCACCAGCGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAA 

CCAGAAGTGAACGCGACCCCCCTCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCrCA 

GCGGCTCGTGGTGTGAGTGCCAGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTA 
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TTTCTGGTACTCCTATGCAATATTACTCAGCT 

TGCTCAGAGGGGATGTAGGCCCATGGGCAC^ 

CCCTTGGCTTTAGAGCTGTGTTGAAATCCAT^ 

TAAAATAGTTAAGAATCCAAGTCGAGAACACTTC 

GCAGTTACGGCCTGGATGCCAGTCGCCAGCCCAGCTC 

GCAGACTGCGATC^UVITCTGCATGGTCACA^ 

CATTCCATAGCCATGTGCTTGTCCXrrGTGCTCCCACGGTTCCCAGGGGCCA 

CCCCACTATGCCGCAGGCCGCCCTACCCACCTTC 

CGGTOSCCGTGTGGCCAGAGTGGGTCTCTCG^^^ 

GTCTGGGGCOTCACCAGCAGGAGCGCGTGCAAGCCGGG<^GGCGGTCCACOT 

GAGCACCCC^CCTCTGTGTGTGATGTAGCT^ 

AAAGACAACCTCTACTTTTTTCTTTTGTATTTTGATAAACA 

GGGAAACCTCAGAACTGGTCTATTTGGTGTCGTGGAACCTCTTACT^ 

AACTGTTTTGTATACTTGTTTTCAGTTTTCATTTCGACAAACAA^ 

AAATCTCTTAACTATTAAAAAAAAAAAAAAAAAA 

Human Arfl mRNA sequence - varlO. (public gi: 16553846) (SEQ n> NO: 334) 

GTGGGAGCAAAACCAACX3CCTGGCTCGGAGCAGCAGCCT 
TGTCCACAAGCATGGGGAACATCITCGCCAACCT 

CCTCATGGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTG 

ACCACCATTCCCACCATAGGCTTCAACGTGGAAA'CCGTG^ 

ACGTGGGTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCA^^ 

CGTGGTGGAC^GCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAG 

GACGAGCTCCGGGATGCTGTCCTCCTGGTGTTCGCCAACA 

CCGAGATCACAGACAAGCTGGGGCTGCACrCACTACGCCA 

CACCAGCGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACG 
CGACCCCCCTCCCTCTCACTCCTCTTGCCCTOTGCTTTACTCT(^TGTGGCAAACGTGCGGCTCGTGGTG 
TGAGTGCCAGAAGCTGCCTCCGTGGTTTGGTGACCGTGTGC^ 

CAGCOTGCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAGTTTCr 

TATGC^TATTACTCAGCTTTTTTTATTGTAAAAAGAAAAATC 

TGTAGGCCCATGGGCACCTGCXTCTCCAGGAGTCGCTGTGT^ 

AGCTGTGTTGAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAA 
AATCCAAGTCGAGAAC^CTTGAACACACAGAAGGGAGACCCCGCCT 

GGATGC^GTCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATC 

AATTCTGCATGGTCACAGTAGAGATCCCCGCAACTCGCT 

TGTGCTTGTCCCTGTGCTCCCACGGTTCCCAGGGGCCAGG^^ 

CAGGCCGTCCTACCCACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTGTC 

GCCAGAGTGGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGAC^ 

CCAGCAGGAGCGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCC^CAGCCCCTCGGGAGCACCCCACCT 

CTGTGTGTGATGTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAAAGACAACCTCr 

ACTTTTTTCTTTTGTATTTTGATAAACACTGAAGCTGGAGCTC 

ACTGGTCTATTTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTA^ 

ACTTGTTTTCAGTTTTCATTTCGACAAACAAGCACTGTAATTATAGCTATTAGAATAAA^ 

ATT 

Human Arfl mRNA sequence - varl 1 (public gi: 16553 799) (SEQ ID NO: 335) 

aaccaacgcctggctcggagcagcagcctctgaggtgtccctggc 

catggggaac^tcttosccaacctcttcaagggccttto^ 

ggcctggatgctgcagggaagaccacgatcctctacaagcttaagctgggtgagatcgtgaccaccattc 

ccacc^taggcttcaacgtggaaaccgtggagtacaagaacatcagcttcactgtgtgggacgtgg 

ccaggacaagatccggcccotgtggcgccactacttccagaacacacaaggcctgatct 

agcaatgacagagagcx3tgtgaacx1aggcccgtgaggagctcatgaggatgctggccgaggac 

gggatgctgtcctcctggtgttcgccaac^gcaggacctrc^ 

agacaagctggggctgcactcactacgccacaggaactggtacattcaggccacct 

gacgggctctatgaaggactggactggctgtccaatcagctccggaaccagaagtgaacgcgacccccct 

ccctctcactcctcttgccctctgctttactctcatgtggcaaacgtgcggctcgtggtgtgagtgcc^ 

aagctgcctccgtggtttggtcaccgtgtgcatcgcaccgtgctgtaaatgtggcagacgcagcct 

ccaggctttttatttaatgtaaatagtttttgtttccaatgaggcagtttctggtactcctatgcaatat 

tactcagctttttttattgtaaaaagaaaaatcaactc^ 

tgggcacctggcctccaggagtcgctgtgttgggagaggcggccacgcccttggctttagagctgtgttg 

aaatccattttggtggttggtttttaaccc^aact<^gtgcattttttaaaatagttaagaatc 

gagaaga.cttgaacagagagaagggagaccccgcctagcata 
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CGCCAGCCCAGCTGTTCCCCTCG^ 

GGTCACAGTAGAGATCCCCGCAACTCGCTTO^ 

CCTGTGCTCCC^CGGTTCCCAGGGGCCATO 

TACCCACCTTGAGGCAGCCTATGGGACGG^ 

GTCCGTCGTCCCCAACACTCGTGCTCGCTC^GACACTTT 

CGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGGGAGCAC 
TGTAGCTTTCTCTCCCTCAGCCTGCAAGGGTCCGATTTGCCATCGAAAARGACAACCTCT 
TTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCT 
TTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATC 

Human Arfl niRNA sequence - varl2 (public gi: 20147654) (SEQ ID NO: 336) 

ATGGGGAACATCTTCGCGAACCTCTTCAAGG 

GCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGA 

CACCATAGGCTTG^CGTGGAAACCGTGGAGTACAAGAACATCA 

CAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCC^ 

GGAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGA 

GGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCC^ 

GACAAGCTGGGGCTGCACTCACTACGCC^C^^ 

ACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCX^AACCAGAAGTGA 

Human Arfl mRNA sequence - varl3 (public gi: 178163) (SEQ ID NO: 337) 

AAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCC 
GCATGGGGAACATCTTCGCCAACCTCTTCAAGGGCCIT^ 

GGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCACCATT 
CCCACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCAGCTTCACTGTGTGGGACGTGGGTG 
GCCAGGACAAGATCCGGCCCCTGTGGCGCCA.CTACTTCCAGAACACACAAGGCCTGATCTTCGTGGTGGA 
CAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCATGAGGATGCTGGCCGAGGACGAGCTC 
CGGGATGCTGTCCTCCTGGTGTTCGCCAACAAG 

CAGACAAGCTGGGGCTGCACTCAOTACGCCACAGGAACTGGTACATTC^ 

CGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATC^ 

TCCCTCTCACTCCTCTTGCCCTCT^ 

GAAGCTGCCTCCGTGGTTTGGTCACCGTGTGCATCGCACCGTGCTGTAAATGTGGCAGACGCAGCCTGCG 
GCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTT^ 
TTACTCAGCTTTTTTTATTGTAAAAAGAAAAATCAACTCACTGTTC^ 
ATGGGCACCTGGCCTCCAGGAGTCGCTGTGTTGGGAG 

GAAATCCATTTTGGTGGTTGGTTTTTAACCCAAACTCAGTGCATTTTTTAAAATAGTTAA 
CGAGAACACTTGAACACACAGAAGGGAGACCCCGCCTAGCATAGATT^ 

TCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCAGCAGACTGCGATGRATTCTG^ 
TGGTCACAGTAGAGATCCCCGCAACTCGCTTGTCCTTGGGTCACCCTGCATTCCATAGCCATGTGCTTGT 
CCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGGAGCCCACAGCCACCCCACTATGCCGCAGGCCGCC 
CTACCGACCTTCAGGCAGCCTATGGGACGCAGGGCCCCATCTC 

GGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGTCTGGGGCCTCACCAGCAGG 

GCGCGTGCAAGCCXKKSGkGGCGGTCCACCTAGAC^ 

ATGTAGCTTTCTCTCCCTCAGCCTGCT^GGGTCCGATT^ 

TTTTGTATTTTGATAAACACTGAAGCTGGAGCTGTTAAATTTATCTTGGGGAAACCTCAGAACTGGTCTA 

TTTGGTGTCGTGGAACCTCTTACTGCTTTCAATACACGATTAGTAATCAACTGTTTTGTATACT 

CAGTTTTC^TTTCGACAAACAAGC^CTGTAATTATAGCTATTAGAATAAAATCTCTTAACTATTT 

Human Arfl niRNA sequence - varl4 (public gi: 178982) (SEQ ID NO: 338) 

GGGGAAAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTGTCC 
CACAAGCATGGGGAACATCrTTCGCCAACCTCTTCAAGGGC^ 

ATGGTGGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAGCTTAAGCTGGGTGAGATCGTGACCA 
CCATTCCCACCATAGGCITCAACGTGGAAACCGTGGAGTAC^ 

GGGTGGCCAGGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAACACACAAGGCCTGATCTTCGTG 
GTGGAC^GC^TGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCA.TGAGGATGCTGGCCGAGGACG 
AGCTCCGGGATGCTGTCCTCCTGGTGTTCGCCAACAAGCAGGACCTCCCCAACGCCATGAATGCGGCCGA 
GATCACAGACAAGCTGGGGCTGCACTCACTACGCCACAGGi^ 

AGCGGCGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATCAGCTCCGGAACCAGAAGTGAACGCGAC 
CCCCCTCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCTCATGTGGCAAACGTGCGGCTCGTGGTGTGAG 
TGCCAGAAGCTGCCTCCGTGGTTTGGTCACCGTGTGC^ 

CTGCGGCCAGGCTTTTTATTTAATGTAAATAGTTTTTGTTTCCAATGAGGCAGTTTCTGGTACTCCTATG 
CAATATTACTCAGCTTTTTTTATTGTAAAAAGA^ 
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GGCCCATGGGCACCTCGCCTC 

TGTTGAAATCCATTTTGGTGGTTGGTTTT^ 

AGTCGAGAACACTTGAACACACAGAAGGGAGACCC^ 

CAGTCGCCAGCCCAGCTGTTCCCCTCGGGAACATGAGGTGGTGGTGGCGCA 
GCATGGTCACAGTAGAGATCCCCGCAACra^CT^ 
TGTCCCTGTGCTCCCACGGTTCCCAGGGGCCAGGCTGGG^ 
GCCCTACCCACCTTCAGGGAGCCTATGGGACGCAC^ 

TGGTCCGTCGTCCCCAACACTCGTGCTCGCTCAGACACTTTGGCAGGATGT 

AGCGCGTGCAAGCCGGGCAGGCGGTCCACCTAGACCCACAGCCCCTCGGGAGC^ 

GATGTAGCTTTCTCTCCOTCAGCCTGCAAGGGTCCGATT^ 

CTTTTGTATTTTGATAAACACTGAAGCTGGAGCTC 

ATTTGGTGTCGTAGGAACCTCTTACTGCTTTCAAT^ 

TTCAGTTTTCATTTCGACAAACAAGCACTC 

Human Arfl mRNA sequence - varlS (public gi: 3005720) (SEQ ID NO: 339) 

AAACCAACGCCTGGCTCGGAGCAGCAGCCTCTGAGGTC 

GGATGGGGAACATCTTCGCCAACCTCTTCAAGGGCC 

GGGCCTGGATGCTGCAGGGAAGACCACGATCCTCTACAAG 

CCGACCATAGGCTTCAACGTGGAAACCGTGGAGTACAAGAACATCA.GCTTCACTGTC 
GCC^GGACAAGATCCGGCCCCTGTGGCGCCACTACTTCCAGAAC 
CAGCAATGACAGAGAGCGTGTGAACGAGGCCCGTGAGGAGCTCAT^ 
CGGGATGCTGTCCTCCTGGTGTTCGCCAACAAG 

CAGACAAGCTGG<^CTGCACTCACTACGCCACAGGAACTGGTACATTC^ 

CGACGGGCTCTATGAAGGACTGGACTGGCTGTCCAATC^GCTCCGGAACCACVVAGTC 

TCCCTCTCACTCCTCTTGCCCTCTGCTTTACTCT 

gaagctgcctccgtggtttggtcaccgtgtgcatcgc^ccgtgctgtaaatgtggcagacgcacctgcgg 

ccaggctttttatttaatgtaaatagtttttgtttcc^ 

tactcagctttttttattgtaaaaagaaaaatcaactc^ 

tgggcacctogcctccaggagtcgctgtgttgggagagccggccacgccct^ 

aaatccattttggtggttggtttttaaccgaaactcagtgcattttttaaaatag 

gagaacacttgaacacacagaagggagaccccgcctagcatagatttgcagttacggcctgga 

cgcctvgcccagct'gttcccctcgggaagatgaggtggt^^ 

ggtcacagtagagatccccgcaactcxkittgtccttggtc^ccctgcattccatagccatgtgct 

crgtgctcccacggttcccaggggccaggctgggagcccacagccacccgactatgccgc^ 

acccaccttcaggcagcctatgggacgcagggccccatctgtccctcggtcgccgtgtggccagagtggg 

tccgtcgtccccaacactcgtgctcgctcagacactttggcaggatgtctggggcctcaccagcaggagc 

gcgtgcaagccgggcaggcggtccacctagacccacagcccct 

gtagctttctctccctcagcctgcaagggtccgatttgccatcgaaaaagacaacctctactot 

ttgtattttgataaacactgaagctggagctgttaaatttatcttggggaaacctcagaactggtctatt 

tggtgtcgtggaacctcttactgcttt 

AAA 

Human Arfl protein sequence - var 1 (public gi: 3 3 604 9 1) (SEQ ID NO: 223) 

MGNI FANLFKGLFGKK^MRlIiMVGLDAAGKTTI LYKLKLGE I VTT I PTIGFNVE TVEYKNIS FTVWDVGG 
QDKI RPLWRHYFQNTQGIi I FVVDSNDRERVNE AREELMRMLAEDELRDAVL^ I T 

DKLGLHSLRHRNW YI QATCATSGDGL YEGLDWLSNQLRNQK 



Figure 36 part - 7 



52/202 



WO 2004/078130 



PCT/US2004/006308 



Unigeae Name; ARF5 Unigene" ID: Hs. 430657 

Human ARF5 mRNA sequence - varl (public gi: 178986) (SEQ ID NO: 340) 

CCAGTTCCAGCCCGCACCCCGCGTCGGTGC^ 

CCGCGCTCTTTTCGCGGATCTTCGGGAAGAAGCAGATGCC^ 

CAAGACCAG^TCCreTACAAACTGAAGTTG 

GTAGAAAC^GTGGAATATAAGAACATCTGTTTCACAGTCT 

CTCTGTGGCGGCACTACTTCCAGAACACTCAGGGC 

GGTCCAAGAATCTGCTGATGAACTCCAGAAGATGCTGCAGG 

GTATTTGCCAACAAGCAGGACATGCCCAACGCCATC 

AGCACTTACGCAGCCGCACGTGGTATGTCCAGGCCACCT 

TCTGGACTGGCTGTCCCACGAGCTGTCAAAGCGCTAACCAGCCAGGGGCAGGCCCCTGAT 

TCCTGCGTGCATCCCCGGGATGACCAGACTCCCGGACTCCT 

TCCCCCATAGCCACAGGCCTCTGCTCCTGCT 

TTGCTCTCTGGGCACAGAGGGGTCCACTCTCCTGCCTGCTGGGACCTATG 
GCCCCCTCTTCCAGAGGAGGAGCAGGGATCTGGGTTTCCTTTTTTT 
GGCCAGGTTGGGAGGGGGAAGGTGAGGGCTTCGGGTGGTGCT 
AATTTGCTGTGGTTTG 

Human ARF5 mRNA sequence - var2 (public gi: 21620017) (SEQ ID NO: 341) 

CTCCTCCTGCTGCTGCTGCGCCCCATCCCCCCX3CGGCCGGCCAGTTCCAGCCCGCACCCC 

CCGCK3CCCCTCCCCGGGCTCCGCCATGGGCCT 

AGCAGATGCGGATTCTCATGGTTGGCTTGGATGCGGCTGGCA^ 

GGGGGAGATTGTCACCACCATCCCAACCATAGGCTTC^^ 

TTCACAGTCTGGGACGTGGGAGGCCAGGACAAGATTCGGCCTC^ 

AGGGCCTCATCTTTGTGGTGGAGAGTAATGACCGGGAGCGGGTCCAAGAATCTGCTGATG 

GATGCTGCAGGAGGACGAGCTGCGGGATGCAGTGCTGCTGGTATTT^ 

GCCATGCCCGTGAGCGAGCTGACTGACAAGCTGGGGCTAC^^ 

AGGCCACCTGTGCCACCCAAGGCACAGGTCTGTACGATGGTCTGGACTGGCTGTCCCA 

GCGCTAACCAGCCAGGGGCA.GGCCCCTGATGCCCGGAAGCTCCTGCGTGCATCCCCGGGATGACCAGACT 

CCCGGACTCCTCAGGC^GTGCCCrTTCCTCCCA^ 

TCCTGCCTGCATGTTCTCTCTGTTGTTGGAGCCTGGAGCCTTGCTCTCTGGGCACAGAGGGGT^ 

CCTGCCTGCTGGGACCTATGGAAGGGGCTTCCTGGCCIAAGGCCCCCTCTTCCAGAGGAGGAGCAGGGATC 

TGGGTTTCCTTTTTTTTTTCTGTTTTGGGTGTACTCTAGGGGCCAGGTTGGGAGGGGGAAGGTGAGGGCT 

TCGGGTGGTGCTATAATGTGGCACTGGATCTTGAGTAATAAATTTGCTGTGGTTTGTAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAA 

Human ARF5 mRNA sequence - var3 (public gi: 12 8 04 3 64) (SEQ ID NO: 342) 

CCCGCGTCGGTGCCCGCGCCCCTCCCCGGGCCCCGCC^TGGGCCTCACCGTGTCCGCGCTCTTTTCGCGG 
ATCTTCGGGAAGAAGCAGATGCGGATTCTCATGGTTGGCTTGGATGCGGCTGGCAAGACCACAATCCTGT 
ACAAACTGAAGTTGGGGGAGATTGTCACCACCATCCC^ 

TAAGAACATCTGTTTCACAGTCTGGGACGTGGGAGGCCAGGACAAGATTCGGCCTCTG^ 

TTCCAGAACACTCAGGGCCTCATCTTTGTGGTGGACAGTAATGACCGGGAGCGGGTCCAAGAATCT 

ATGAACTCCAGAAGLATGCTGCAGGAGGACGAGCTGCGGGATC 

GGACATGCCC^CGCCATGCCCGTGAGCGAGCTGACTGACAAGCTGGGGCTACAGCACTTACGCAGCCGC 
ACGTGGTATGTCC7VGGCCACCTGTGCCACCCAAGGa\CAGGTCTGTACGATGGTCTGGACTGGCTGTCCC 

acgagctgtc^aagk:gctaaccagcc^ggggcaggcccctgatgcccggaagctcct 
gatgacc^tactcccggactcct^ggc^^ 

ctctgctcctgctcctgcctgcatgttctctctgttgttggagcctggagccttgctctctgggcacaga 
ggggtccactctcctgcctgctgggacctat^ 

GGAGCAGGGATCTGGGTTTCCTTTTTTTTTTCTGTTTTGGGTGTACTCTAGGGGCCAGGTTGGGAGGGGG 
AAGGTGAGGGCTTCGGGTGGTGCTATAATGTGGCACTGGATCTTGAGTAATAAATTTGCTGTGGTTTGAA 
AAAAAAAAAAAAAAAAAAA 

Human ARF5 mRNA sequence - var4 (public gi: 30583012) (SEQ ID NO:343) 

ATGGGCCTCACCGTGTCCGCGCTCTTTTCGCGGATCTTCGGGAAGAAGCAGATGCGGATTCTCATGGTTG 
GCTTGGATGCGGC^GGCAAGACGACAATCCTGTACAAACTGA 

AACCATAGGCTTCAATGTAGAAACAGTGGAATATAAGAACATCTGTTTCACAGTCTGGGACGTGGGAGGC 
CAGGACAAGATTCGGCCTCTGTGGCGGCACTACTTCCAGAACACTCAGGGCCTCATCTOT 
GTAATGACCGGGAGCGGGTCCAAGAATCTGCTGATGAACTCCAGAAGATGCTGGAGGAGGACGAGCTGCG 
GGATGCAGTGCTGCTGGTATTTGCCAACAAG<^GGAC^TGCCCAA 
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GACAA.GCTGGGGCTACAGCACT 
CAGK5TCTCTACGATGGTCTGGACTGGCTC 

Human ARF5 mRNA sequence - var5 (public gi: 6995999) (SEQ ID NO: 344) 

CCGCGTtt^TGCCCGCGCCCCTCCCCGGGCCC 

TCTCCGGGAAGAAGCAGATGCGGATTCT 

CAAACTGAAGTTGGGGGAGATTGTCACCAC 

AAGAACATCTGTTTCACAGTCTGGGACGTGGGAGGCCAGGACAAGATTCGGCCT 
TCCAGAACACTCAGGGCCTCATCTTTGTGGTGGAC^ 
TGAACTCCAGAAGATGCTGCAGGAGGACX3AGCTC 
GACATGCCCAACXSCCATGCCCGTGAGCGAGCTGACTGAC^ 

CGTGGTATGTCCAGGCCACCTGTGCCACCCAAGGCA.CAGGTCTGTACGATGGTCTC 
CGAGCTGTCAAAGCGCTAACCAGCCAGGGGCAGGCCCCTGATGCCCGGAA^ 
GATGACCAGACTCCCGGACTCCTCAGGCAGTGCCC 
CTCTGCTCCTGCTCCTGCCTGCATGTTCTCT 

GGGGTCCACTCTCCTGCCTGCTGGGACCTATGGAAGGGGCTTCCTGGCCAAGGCCCCCrOT 

GGAGGAGGGATCTGGGTTTCCTTTTTTTTT^ 

AAGGTGAGGGCTTCGGGTGGTGCTATAATGTGGCACTGGATCT 

Human ARF 5 protein sequence - var 1 (public gi:305830i3) (SEQ ID NO: 224) 

MGDTVSALFSRI FGKKQMRILMVGIiDAAGKTTILYKLKIjGEIVTTI PTIGFNVETVEYKNICFTVWDVGG 
QDKI RPLWRHYFQNTQGLI FVVDSNDRERVQESADELQKMLQEDEIjRDAVLLVFANKQDMPNAMPVSELT 
DKXjGIjQHIjRSRTWYVQATCATQGTGLYDGIjDWLSHELSKR 



Uhigene Name: ATP6V0C Unigene ID: Hs . 389107 

Human ATP6V0C raRNA sequence - varl (public gi: 33 8743 73) (SEQ ID NO: 345) 

GGTATTTAGAGCGCAGCGGCTGACGGGCCGGATCGCCTTCGCCGCCGCCCGCCCGC^VAACCTTCGTGCCC 
GGCCCGTCCTCGCCCCCGCCTC03CCAC^ 

TGTCCGAGTCCAAGAGCGGCCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCTCGGCCGCCATGGT 
CTTCAGCGCCCTGGGCGCTGCCTATGGCACAGCCAAGAGCGGTACCGGCAT^ 

CGGCCGGAGCAGATCATGAAGTCCATCATCCCAGTGGTCATGGCTGGCATCATCGCCATCTACGGCCT^ 

TGGTGGCAGTCCTCATCGCCAACTCCCTGAATGACGACATCAGCCTCTACAAGAGCTTCCTCCAGCTGGG 

CGCCGGCCTGAGCGTGGGCCTGAGCGGCCTGGCAGCCGGCTTT^ 

GTGCGGGGCACCGCCCAGCAGCCCCGACTATTCGTGGGCAT^ 

TCGGCCTCTACGGTCTCATCGTCGCCCTGATCCT 

CAGAAjTATTATGTAAAGACCACCCCTCCTCATTCCAGAACGAACAGCCTGACACATACGCACGGGGCCGC 

CX3CCCCCAGTAGTTGGTCTTGTACATGCGCAGTGTCCTAGTGCCCATCGTCTGTTTCCCC 

CCGCCCGCCCCGTGCCGTGGACATCTGGGCCCACTCATCGCCCCTCCAGGCCCCCGGCGCCCCACCCCeT 

AGAGTGCTCTGTGTATGCGGATGATTTAGAATTGTCATTTCTCTTTACTGGATGTTTATTTATAAAGATC 

TGGCCTGTTCCTGCGTCTGCGGAGCGGCCCTTGTCTCCCAGCTATCTATAACCTTAGCTAGAGTGTCGCC 

TTGTGGGTTCCTGTTGCTGAGACTTCCTGGATGGAGCCGCCCTCACCGCCGGGCCCGTGGCCCTGCGCGG 

AGCTGTGTCCZAATAAAGTTCTTGGATGTGAAAAAAAAAAAAAAAAAAAAAAA 

Human ATP6V0C mRNA sequence - var2 (public gi: 33 872390) (SEQ ID NO: 346) 

GGCTGACGGGCCGGATCGCCTTCGCCGCCGCCCGCCCGCAAACCTTCGTGCCCGGCCCGTCCTCGCCCCC 
GCCTCCGCCACCGCCTCGGCCCGCAGAGCTTGCCCCCTCCCCACCCGC^GAaVTGTCCGAGTCCAAGAGC 
GGCCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCTCGGCCGCCATGGTCTTCAGCGCCOTGGGCG 
CTGCCTATGGCACAGCCAAGAGCGGTACCGGCATTGCGC^ 

GAAGTCCATCATCCCAGTGGTCATGGCTGGCATCATCGCCATCTACGGCCTGGTGGTGGCAGTCCTCATC 
GCCAACTCCCTGAATGACGACATCAGCCTCTACAAGAGCTTCCT 

GCCTGAGCGGCCTGGCAGCCGGCTTTGCC^TCGGC^TCGTGGGGGACGCTGGCGTGCGGGGC^CCGCCCA 
GCAGCCCCGACTATTCGTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTGCTCGGCCTCTACGGTCTC 
ATCGTCGCCCTCATCCTCTCCACAAAGTAGACCCT^^ 

ACCACCCCTCCTCATTCCAGAACGAACAGCCTGACACATACGCACGGGGCCGCCGCCCCCAGTAGTTGGT 
CTTGTACATGCGCAGTGTCCTAGTGCCCIATCGTCTGTTTCCCCGGCCTTGCCCCCGCCCGCCCCGTGCCG 
TGGACATCTGGGCCCACTCATCGCCCCTCCAGGCCCCCGGCGCCCCACCCCCTAGAGTGCTCTGTGTATG 
CGGATGATTTAGAATTGTCATTTCTCTTTACTGGATGTTTATTTATAAAGATCTGGC 
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TGCGGAGCGGCCCTTGTCTCCCAGCTATCTATAACCTTAGCT 
TGAGACTTCCTGGATGGAGCCGCCCTCACC^ 

"TTCTTGGATGTGAAAAA7VAAAAAAAAAAAATAAAAAAAA7^AAAAAAA 
AAAAAAAAAAAAAA 

Human ATP6V0C mRNA sequence - var3 (public gi: 33873673) (SEQ ID NO: 347) 

CGCCTTCGCCGCCGCCCGCCCGC^ 

CGGCCCGCAGAGCTTGCCCCCTCCCCACCCGCAGAC^^ 

CGTTTTTCGCCGTGATGGGCGCCTCGGCCGCCATGGTC 

CAAGAGCX3GTACCGGCATTGCX3GCCATGTCTGTCATGCGGCCGGAGCAGATCATGAAGTC 
GTGGTCATGGCTGGCATCATCGCCATCTACGGCCTGGTGGTGGCAGTCCTCATCGCCAACTCCOT 

ACGACATCAGCCTCTACAAGAGCTTCCTCCAGCTGGGOT 
AGCCGGCTTTGCCATCGGCATCGTGGGGGACGCTGGro 

GTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTGCTCGGCCTCTACGGTCTCATCGTCGCCCTCATCC 

TCTCCACAAAGTAGACCCTCTCCGAGCCCACCAGCCACAGAATATTATGTAAAGACCACCCCT 

CCAGAACGAACAGCCTGACACATACGCACGGGGCCGC^ 

GTCCTAGTGCCCATCGTCTGTTTCCCCGGCCTTGCCCCCGCCCGCCCCGTGCCGTGGACATCTGGGCCCA 
CTCATCGCCCCTCCAGGCCCCCGGCGCCCGACCCCCTAGAGTGCTCTGTGTATGCGGATGATTTAGAATT 
GTCATTTCTCTTTACTGGATGTTTATTTATAAAGATCTGGCCTGTTCCTGCGTCTGCGGAGCGGCCCTTG 
TCTCCCAGCTATCTATAACCTTAGCTAGAGTGTCGCCTTGTGGGTTCCTGTTGCTGAGACTTCCTGGATG 
GAGCCGCCCTCACraCCGGGCCCGTGGCCCTGCGCGGAGCTGTGTCCAATAAAGTTCTTGGATGTGAAAA 

AAAAAAAAAAAAAAAAAAA 

Human ATP6V0C mRNA sequence - var4 (publicgi: 33990932) (SEQ ro NO: 348) 

GACGGGCCGGATCGCCTTCGCCGCCGCCCGCCCGCAAACCTTCGTGCCCGGCCCGTCCTC^ 

CCGCCACCGCCraSGCCCGCAGAGCrTGCCCCCTCCCCACCCGCAGACATGTCCGAGTCCAAG^^ 

CCGAGTATGCTTCGTTTTTCGCCGTCATGGGCX3CCT 

CTATGGCACAGCCAAGAGCGGTACCGGCATTGCGGCCATGTCTGTCATGCGGCCGGAGCAGATCATGAAG 

TCCATC^TCCCAGTGGTCATGGCTGGCATCATCGCCATCTACGGCCTGGTGGTGGCAGTCCTCATCGCCA 

ACTCCCTGAATGACGACATCAGCCTCTACAAGAGCTTCCTCGAGCTGGGCGCCGGCCTGAGCGTGGGCCT 

GAGCGGCCTGGCAGCCGGCTTTGCCATCGGCATCGTGGGGGACGCTGGCGTGCGGGGCACCGCCCAGCAG 

CCCCGACTATTCGTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTGCTCGGCCTCTACGGTCTCATCG 

TCGCCCTCATCCTCTCCACAAAGTAGACCCTCTCCGAGCCCACCAGCCACAGAATATTATGTAAAGA 

CCCCTCCTC^TTCC^^CGAAC^GC^^ 

TACATGCGCAGTGTCCTAGTGCCCATCGTCTGTTTCCCCGGCCTTGCCCCCGCCCGCCCCGTGCCGTGGA 
C^TCTGGGCCCACTCATCGCCCCTCCAGGCCCCCGGCGCCCCACCCCCTAGAGTGCTCTGTGTATGCGGA 
TGATTTAGAATTGTCATTTCTCTTTACTGGATGTTTATTTATAAAGATCTGGCCTGTTCCTGCGTCTGCG 
GAGCGGCCCTTGTCTCCCAGCTATCTATAACCTTAGCTAGAGTGTCGCCTTGTGGGTTCCTGTTGCTGAG 
ACTTCCTGGATGGAGCCGCCCTCACCGCCGGGCCCGTGGCCCTGCGCGGAGCTGTGTCCAATA7^AGTTCT 
TGGATGTGAAAAAAAAAAAAAAAAAAA 

Human ATP6V0C mRN A sequence - var5 (public gi: 1991343 6) (SEQ ID NO: 349) 

GTTCTGCGGTGCTGGTATTTAGAGCGCAGCGGCTGACGGGCCGGATCGCCTTCGCCGCCGCCCGCCCGCA 

AACCTTCGTGCCCGGCCCGTCCTCGCCCCCGCCTCCGCCACCGCCTCGGCCCGCAGAGCTTGCCCCCTCC 

CCACCCGCAGACATGTCCGAGTCCAAGAGCGGCCCCGAGTATGCTTCGTTTTTCGCCGTCATGGGCGCCT 

CGGCCGCCATGGTCTTCAGCGCCCTGGGCGCTGCCTATGGCACAGCCAAGAGCGGTACCGGCATTGCGGC 

CATGTCTGTCATGCGGCCGGAGCAGATCATGAAGTCCATCATCCCAGTC 

ATCTACGGCCTGGTGK3TGGGAGTCCTGATCGC 

TCCTCCAGCTGGGCGCCGGCCTGAGCGTGGGCCTGAGCGGCCTGGCAGCCGGCTTTGCCATCGGCATCGT 
GGGGGACGCTGGCGTGCGGGGCACCGCCCAGCAGCCCCGACTATTCGTGGGCATGATCCTGATTCTCATC 
TTCGCCGAGGTGCTCGGCCTCTACGGTCTCATCGTCGCCCTCATCCTCTCCACAAAGTAGACCCTCTCCX3 
AGCCCACGAGCCACAGAATATTATGTAAAGACCACCC^ 

CGCACGGGGCCGCCGCCCCCAGTAGTTGGTCTTGTACATGCGCAGTGTCCTAGTGCCCATCGTCTGTTTC 
CCCGGCCTTGCCCCCGCCCGCCCCGTGCCGTGGACATCTGGGCCCACTCATCGCCCCTCCAGGCCCCCGG 
CGCCCCACCCCCTAGAGTGCTCTGTGTATGCGGATGATTTAGAATTGTCATTTCTCTTTACTGGATGTTT 
ATTTATAAAGATCTGGCCTGTTCCTGCGTCTGCGGAGCGGCCCTTGTCTCCCAGCTATCTATAACCTTAG 
CTAGAGTGTCGCCTTGTGGGTTCCTGTTGCTGAGACI^ 

TGGCCCTGCGCGGAGCTGTGTCCAATAAAGTTCTTGGATGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAA 

Human ATP6V0C mRNA sequence - var6 (public gi: 3453444 7) (SEQ ID NO: 350) 
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TTTATGCTTGTGTTTCTGC^CT 
TCACAGGTCTGCCTTCTGCAGTCTGK3TCAGGCT 

CCGCTCCTGGTGGCCATCCCTTTCTTCCCTGTGCTCCTCTTGGTGCCTCCTCCCCCT 

TTCTTCCTGCTGCCCTCTGTAGAAAAGGGCCTGGCTCACTTCC^ 

GGGTCCACTACTTGGGTTGCTGAGTTCCCTGTATTCAGT^ 

TCTTGTGCATACATGATGCAJGTTGGATGTGGTCCTGGGCCTG 

ATTGCTCTATATGCTTGCCAGGGAAAAAATGCACTGTA 

CCTGGGCCCCAGTCTGCAGGTGCACTGGGTGTTGGCATGGCATGTCTG 

GGAGGCCGTGTGGOTCCCTGGCCCAGGTCTCAGCCTCCTTCCTCCCTCTATAGTGA 

AGCACCGTCGTCTTGGGTGCCTCTGCAGGTGCTATCCAGAGCCCTTGTCTTATTGCCT 

ACTCCTCTCTCCCGCCAACTTGGGAT^^ 

CTGGAGCATGTCTCTGTGCCIX3GAGGTC^ 

GGTAGGCAAGTTTTGTGGCCCAAATATATCAATAAAATATGAAG 

CTTCCACCTCTGACATATGTAGTCTTCTGCAGGTCAGGCTGTTTGTG 

GTGTGTGTGTGTCTGTCAGAGATTCACTCTTGTTGTTTGT^ 

CTGGAGTGCAGTGGCGTGATCTTGACTCACTGCAACCTCCACTCCTGGGTTCAGGCGATTCTCCTGCCT 

AGCCTCCCTAGTAACTGGGATGACAGGGATGCGCCACCACTCCTGGCT 

ACGAGGTTTCACCATGTTACCCAGGCTAATCTCGAACTTC 

GCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAGAAAGTTAGCCAGGT 

TGCCTGTAATCCCACTTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGCAGAGGTTA 

GTGAGCCGAGATCGCGCCACTGCACTCCACCCT^^ 

AAAAAATTTTTCATTTGAGGTATTCTTCCAGTAGAAGGTTAGTAAGTTTTTAATGA 

ACACTTCCCAGAAAATAGATGACATCAGTGCCCCTTGCTACTTTCT 

CCCAGGTACTGAAACTGGTTGTCTTGAGTTTTGTGTCAGCTTTTTCTCCA 

•GCTTCTGAAGCAGTCTAGGTTAAACTAGCCAGGCAGGTAGTTC 

TTTAGAGATCAGGCCACAGCTTTTTATATCGCACAGGAC^ 

TTCCCCAGGAACCTCACTCCTTTGGTAGAACCTTGGGATTTTAGAAATTGT 

TACTCCAACAGTTGAAGTTACAGACATTGCTCCCAAATOT 

TCCCCATCTGGCCTTTACCTTCTTTGCTTC^GTGGTTGAAAACAGTTGCCATATTGAAAGTAT 

TTCAACCTCACACAAATGACAAGTCCCATTTTACAATCCT^ 

CAGGGCGGCTGCAGTTGGAG<SCCGAGGGGAGCCCTCTX3^ 

CCCGCAGTGCTGAACATGCCCCACCGCCCAGGCCCAGCACTGCTTGTTGGGTCAG 

Human ATP6V0C mRNA sequence - var7 (public gi: 3 o 5 8 3 14 8) (SEQ ID NO: 351) 

ATGTCCGAGTCCAAGAGCGGCCCCGAGTATGCT^ 

TCTTCAGCGCCCTGGGCGCTGCCTATGGCAGAGCCAAGAGCGGTACCGGCATTGCGGCCATGTCTGTCAT 

GCGGCCXSGAGCAGATCATGAAGTCCATCATCCCAGTGGT^^ 

GTGGTGGCAGTCCTCATCGCCAACrCCCTGAATGACG 

GCGCCGGCCTGAGCGTGGGCCTGAGCGGCCTGGCAGCCGGCTTTGCCATCGGCATCGTGGGGGACGCTGG 
CGTGCGGGGCACCGCCCAGCAGCCCCGACTATTCGTGGGCATGATCCTGATTCTCATCTTCGCCGAGGTG 
CTCGGCCTCTAGGGTCTCATCGTCGCCCTCATCCTCTCCACAAAGTAG 

Human ATP6V0C protein sequence - varl (public gi: 3 0583 149) (SEQ ED NO: 225) 

•MSESKSGPEYASFFAVMGASAA^WFSAI/3AAyGTAKSGTGIAAMSVMRPEQIMKS 1 1 PWMAGI IAIYGIi 
WAVLI ANSLNDDI SLYKS FLQLGAGLSVGLSGIjAAGFAI GI VGDAGVRGTAQQPRXjFVGMILI LI FAEV 
LGLYGLIVALILSTK 

Human ATP6V0C protein sequence - var2 (public gi: 34534448) (SEQ ID NO: 226) 

MI LPAALCRKGPGS LPASGGLLASQGPLIiGLIiS SLYSVS CQRVCHALVS CAYMMQLDWLGLQWEP PKMH 

CNCSICLPGKKCTVTRSSGQALGQALGPSLQVHWVLAWHVWAPPGWRGGGRVAP 

TQHRRLGCLCRCYPEPIxSYCIjVFL 

Human ATP6V0C pray sequence - varl (SEQ ID NO: 352) 

ccgccatggagtacccatacgacgtaccagattacgctcatatggccatggaggccagtgaattccaccc 

aagcagtggtatcaacgcagagtggccatttggggggtctgcggtgctggtatttagagcgcagcggctg 

acgggccggatcgccttcgccgccgcccgcccgcaaaccttcgtgcccggcccgtcctcgcccccgcctc- 

cgccaccgcctcggcccgcagagcttgccccctcccccccatgtcggccgcctcggcctctagagggtgg 

gcatcgatacgggatcc^tcgagctcgagctgc^gatgaatcgtagatactgaaaaaccccgcaagttca 

cttcaactgtgcattcgtgca 

Unigene Name : CBLB Unigene ID : Hs.3144 Clone ID : 3GDJL14 
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Human CBL-B mRNA sequence - varl (public gi: 4757919) (SEQ ID NO: 353) 

CTGGGTCCTCTGTGTGCCACAGGGGTGGGGTGTC 

GGCGTCCCGCGGCCTCCCCGAGTQSGGCXKjGAGGGGAGAGCGGGTGTGG 

TGCGTTTCCACGTCTCGGAGGCCTGCGCGCT 

CCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTGCGTG 

TGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAAT^ 

CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTT^ 

TGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAG 

GTAAGACTGTGCCAAAATCCCAAACTTGAGTTGAAAAATAGCCCAC 

ATACATATCAGCATTTACGACTTATATTGAGTAAAT^ 

<3AATGAGTACTTTAAAATCTACATTGATAGC 

GAAGGCAAGGAGAGAATGTATGAAGAACAGTGACAGGACAGAC^ 

TCAGTCACATGCTGGCAGAAATCAAAGCAATCTTTC^ 

GACAAAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAG 

TTCAGACAGTGCCTTCATGAGGTCCACCAGATTAGCTCTAGCCTGGAAGC^ 

TTGATTTAACTTGCAATGATTACATTTCA^ 

GGGCTCTATTTTGCGGAATTGGAATTTCTTAGCTGTGACACATCCAGGTTACATGGCATTTCTCA^ 

GATGAAGTTAAAGCACGACTACAGAAATATAGCACCAAACCCGGAAGCTATATTTTCCGGTT^ 

CTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTACAGACCA 

CAAGCCCTTATTTCAAGCCCTGATTGATGGCAGCAGGGAAGGATTTTATCT^ 

TATAATCCTGATTTAACTGGATTATGTGAACCTACACCT 

ATGAATTATATTGTGAAATGGGCTCCACTTTTCAGCTCTGTAAGATTTGTGCZAGAGAATGAC 
CAAGATTGAGCCTTGTGGGGATTTGATGTGGACCT 

GGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCATAATCGTGGACCCCTTTGATCCAAGAG 

ATGAAGGCTCCAGGTGTTGCAGCATCATTGACCCCTTT 

TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAA 

CCAGTCACATCACCAGGATCCTCTCCCCTTGCCCAGAGAAGAAAGCCACAGCCTG 

CACATCTAAGCCTGCCACCCGTGCCTCCTCGCCTGGATCTAATTCAGAAAGGCATAGTTAGATCTCCOT^ 
TGGCAGCCCAACAGGTTCACCAAAGTCTTCTCCTTGCAT^ 
CC^CCTCCTCCCTTAAGAGATCCTCCTCCACC^ 
GAOTGAGTAGACACATCCATGATGTGGAAAGCGTGCCTTCCAGA^ 

GTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGGAGGGCTCTCCAAAA 
CCTGGAATCA(^GCGAGTTCAAATGTGAATGGAAGGCACAGT^ 

GGAAAGA.CAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTTTTCCAATGGTCACCTTGGAAGTGA 
AGAATATGATGTTCCTCCCCGGCTTTCTCCTCOTCCTCC^GTTACCACCCTCCTCCCTAGCATAAAGTGT 
ACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGAGACCCAGTAGAGGAAGATGATGAT 
AGATTCCTTCATCCCACCCTGTTTCCCTGAAT^ 

TCGGTCCTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCATCTTCAGAGAAGAAATCAAAC 
ATCCCTGACTTAAGCATATATTTAAAGGGTACGTATAGAATATAATTTCCTTTGTGATGTACATCTTAAT 
GGTCAGAATTTAAAGGCAAAATTTCATGCCATTGTACTGAAAA 

GGAGATGTTTTTGATTCAGCCTCTGATCCCGTGCCATTACCACCTGCCAGGCCTCCAACTCGGGACAATC 

CAAAGCATGGTTCTTCACTC^CAGGACGCCCTCTGATTATGATCTTCTCATCCCTCCATTAGGT 

CCTTTAAAAAAGTTTTGAACAACCCACCCCTCCTTC^ 

GTATAACACAGACTCACTGGGTTGTGAATTTGCCTGAAATTTGAATGGGTTCTCCAGGTGCCGGTGACTC 
CC^GTTCACGAGACCATTACTCCATGTAGATGATTAAGGTAGTAGTGTAGTAGTTGGGCATCAGTCAGG 
TTTTAAGCAAGTTGTTTTGTCCATACTAAATGTA 

TTTGAAGTGATGACTTGAAGTGTTGAGATTTTCTTTAAGTATAATAATTCTTAATAAATATGAACTTGCT 
TTTCTTGCAGCATGAGCACCAGTTCCACTTACGCTAATTAAATTATGCAAAATTAAATAGTTGTATGTAG 
AGAACTGATAATAAATTCTGTTTTATTCTAATCATTACAACTGTAACACATTCAAAAAAAAAAA 

Human CBL-B mRNA sequence - var2 (public gi: 2327390s) (SEQ ID NO: 354) 

AGCGGAGTGCTGCTGCGGCGTCCCX3CGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTG 
TCTTGACGGTAATTGTTGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCG 
AGCTGTTCTCAGCG^TCCCACTCCCAGCCGGGGCTCCCC^ 
GACCCGCGC^CACGCGTGTCTCTGGACAGCTACGGCGCCGAAAGA 

ATGAATGGCAGAAACCCTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATG 

CTATTCAGGATGCAGTTGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAGAAGACTTGGAA 

GCTCATGGACAAAGTGGTAAGACTGTGCCAAAATCCCAAACTTCAGTTGAAAAATAGCCCACC^ 

CTTGATATTTTGCCTGATACATATCAGCATTTACGACTTATATTGAGTAAATATGATGACAACCAGAAAC 

TTGCCCAACTCAGTGAGAATGAGTACTTTAAAATCTACATTGATAGCCTTATGAAAAAGTCAAAACGGGC 

AATAAGACTCTTTAAAGAAGGCAAGGAGAGAATGTATGAAGAACAGTCACAGGACAGACGAAATCTCACA 
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ftAACTCTCCCTTATCTTCAGTCACAT 

GAGATAACTTTCGTATCACAAAAGCAGATGCTGCTC 

X^TACCATGGAAAGTATTCAGACAGTGCCT^ 

GCTCTAAAATCAACAATTGATTTAACTTGCAATGATTACATT^ 

GGCTGTTTCAGCCTTGGGGCTCTATTTTGCGGAACT 

GGCATTTCTCACATATGATGAAGTTAAAGCACX^CTACAGAA 

TTCCGGTTAAGTTGCACTOTATTGGGACAGT^ 

AGACCATACCTCATAACAAGCCCTTATTTC^ 

TCCTGATGGGAGGAGTTATAATCCTGATTTAACTGGATTA^ 

GTTACACAGGAACAATATGAATTATATTGTGAAATGGGCTCCACT 

AGAATGACAAAGATGTCAAGATTGAGCCT^ 

GGAGTCX3GATGGTCAGGGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAAOT 
CCCTTTGATCCAAGAGATGAAGGCTCCAGGTGTTGCAGCATCATT<^ 
ACTTGGACGACGATGATGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAA 
TGACAGGCAGAACTCACCAGTCACATCACCAGGATCCTCTCC C 

GACCCACTCCAGATCCCACATCTAAGCCTGCCACCCGTGCCTCCTCGCCTGGATCTAATTCAGAAAGGCA 

TAGTTAGATCTCCCTGTGGCAGCCCAACGGGTTCAC 

TAAACC^CTCCCAGCACC^CCTCCTCCC 

ATCCCACCAGACAATAGACTGAGTAGACACATCCATCATGTGGAAAGCGTGCCTTCCAAAGACC 
TGCCTCTTGAAGCATGGTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACT 
<3GAGGGCTCTCCAAAACCTGGAATCApiGCGAGTTCAAAT^ 

GACCCAGTGCTTATGCGGAAACACAGACGCCATGATTTGCCTTTAGAAGGAGCTAA 

GTCACCTTGGAAGTGAAGAATATGATGTTCCTCCCCGGCTTTCTCCTCCTCCTCCAGTTACCACCCTCCT 
CCCTAGCATAAAGTGTACTGGTCCGTTAGCA^TTCTCTTTCAGAGAAAACAAGAGACCCAGTAGAGGAA 
GATGATGATGAATACAAGATTCCTTCATCCCACCCTC 

ATGTAAAACCTCCTGTTCGGTCTTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCATCTTC 
AGAGAAGAAATCAAACAT CCCTGACTTAAGCAT^ CAGCCT CTGAT 

CCCGTGCCATTACGACCTGCCAGGCCTCCAACTCGGGACAATC 

CGCCCTCTGATTATGATCTTCTCATCCCTCCATTAGGTGAAGATGCTTTTGATGCCCTCCCTC 
CCCACCTCCCCCACCTCCTGCAAGGCATAGTCTCAT^ 
CCATCCTCAGGACAGGATCTTTl^CTTCTTCCTTCAGATCCCTTTGTTGA 
CTTTGCCTCCCGCTAGAAGGTTACCAGGTGAAAATGT 

GCTTCCTTCATGTTCAGATGGTTCACAGGCACCAGCCAGACCCCCTAAACCACGACCGCGCAG^ 

CC^GAAATTCACC^CAGAAAACCCCATGGGCCTGAGGCGGCATTGGAAAATGTCGATGCAA 

AACTCATGGGAGAGGGTTATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGTCGA 

AGTTGCCCGGAGC^TCCTCCGAGAATTTGCCTTCCCTCCTCCAGTATCCCCACGTCTAAATCTATAGCAG 

CCAGAACTGTAGACACCT^AAATGGAAAGCAATCGATGTATTCCAAGAGTGTGGAAATAAAG 

ATGGAATTCAAGAGAGAAGTGTCTCCTCCTCGTGTAGCAGCTTG 

GAAAGGAGACCGATGCTTGCTCAGGATGTCGACAGCTGT^ 

AATCAGGGTTGAACTGACAAAAATAATTTAAAGACGTTTACTTCCCTTGAACTTTGAACCTGTGAAAT 

TTTACCITGTTTACAATTTGGCAAAGTTGCAGTT.TGTTCTTGTTTTTAGTTTAGTTTTG 

TGATACCTGTACTGTGTTCTTCACAGACCCTTTGTAGCGTGG 

CTCTCTTGCTGTCCACATCAACAGCTAAATCATTTAT^ 

CAGGTCCAGTTCCATTTCTCTC^TTCACAAGATGCITTC 

GCAAAAAAAAAAGTATGTATTCTTCACTACTGAGTTTCTTC^^ 

AAAAACCTGAATGTATAAAGGATTTATTTGTCAATAAACT 

AAAAAAAA 

Human CBL-B raRNA sequence - var3 (public gi: 8624 06) (SEQ ID NO: 355) 

CTGGGTCCTGTGTGTGCCACAGGGGTGGGGTGTCCAGCGAGCGGTCTCCTCCTCCTGCTAGTGCTGCTGC 
GGCGTCCCGCGGCCTCGCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAATTGT 
TGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGTTCTCAGCGAT 
CCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCTGOT^ 

TGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAATGGCAGAAACC 

CTGGTGGTCGAGGAGGAAATCCCCGAA^GGTCGAATTTTGGGTATTATTGATGCTATTCAGGATGCAGT 

TGGACCCCCTAAGCAAGCTGCCGCAGATCGC^GGACCGTGGAGAAGACTTGGAAGCTCATGGACAAAGTG 

GTAAGACTGTGCCAAAATCCCAAACTTCAGTTGAAAAATAG^ 

ATACATATCAGCATTTACGACTTATATTGAGTAAATAT^ 

GAATGAGTACTTTAAAATCTAC^TTGATAGCCTTATGAAA^^ 

GAAGGCAAGGAGAGAATGTATGAAGAACAGTCACAGGACAGACGAAATCTCAGAAAACTG 

TCAGTCACATGCTGGGAGAAATCAAAGCAATCTTTCCCAATGGTCAATO 

CT^CAAAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAGACAAAACTATCG 

TTCAGACAGTGCCTTCATGAGGTCCACCAGATTAGCTCTAGCCTGGAAGCAATGGCTCTAAAATCAACAA 
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TTGATTTAACTTGCAATGATTAGATTTCM 

GGGCTCTAXTTTGCGGAATTGGAATTTCTTAGC^ 

GATGAAGTTAAAGCACGACTACAGAAATATAGCACCJ^ 

ctcx^ttgggacagtgggccattggctatgtgactggggatgggaatatct 
caagcccttatttcaagccctgattgatggcagc^ 
tataatcctgatttaactggattatgtgaacctacacctcatga 
atgaattatattgtgaaatgggctccacttttcagctctgtaagatttgtgca 

CAAGATTGAGCCTTGTGGGCATTTGATGTGCAC^ 

GGCTGCCCITT(nH3TCGTTGTGAAATAAAAGGAACTGAGCCC^ 

ATGAAGGCTCCAGGTGTTGCAGCATCATTGACCCCTTT 

TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAACGTCCGAAAGTC 

CCAGTCACATCACGAGGATCCTCTCCCCTTGCCCAGAGAAGAAA 

CACATCTAAGCCTGCCACCCGTGCCTCCTCGCCTGG 

TGG CAGCCCAACAGGTTCACGAAAGTCTTCTCC^^ 

CCACCTCCTCCCTTAAGAGATCCTCCTCCAC 

GACTGAGTAGACACATCCATGATGTGGAAAGCGTGCCTTCCAGAGACCC 

GTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGGAGGGCTCTCCAAAA 
CCTGGAATCACAGCGAGTTCAAATGTCAATGGAAGGCAG 

GGAAACACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTTTTCCAATGGTC^ 
AGAATATGATGTTCCTCCCCGGCTTTCTCCTCCT 

ACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAAC^GAGACCCAGTAGAGGAAGATGATGATGAATACA 

AGATTCCTTCATCCCACCCTGTTTCCCTG^T^ 

TCX3GTCCTGTGATAATGGTCACTGTATGCTGAATG 

ATCCCTGACTTAAGCATATATTTAAAGGGAGATGTTTTTGATTCAGCCTCTGA 
CTGCGAGGCCTCCAACTCGGGACAATCCAAAGCAT 

TCTTCTCATCCCTCCATTAGGTGAAGATGCTTTTGATGCCCTCCCTCCATCTCTCCCACCTCCCCCACCT 

CCTGCAAGGCATAGTCTGATTGAACATTCAAAACCTCCTGGCTCCAGTAGCCGGCGATCCT 

ATCTTTTTCTTCTTCCTTCAGATCCCTTTGTTGATCTAGCAAGTGGCCAAGTTCCTTTG 

AAGGTTACCAGGTGAAAATGTCAAAACTAACAGAAC^TCACAGGACTATGATCAGCTTCCTTC^TGTTCA 

GATGGTTC^CAGGCACCAGCCAGACCCCCTAAACCACGAC^ 

GAAAACCCCATGGGCCTGAGGCGGCATTGGAAAATGTCGATGCAAAAATTGCAAAACTCATGGGAGAGGG 
TTATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGTCGAAGTTGCCCGGAGCATC 
CTCCGAGAATTTGCCTTCCCTCCTCCAGTATCCCCACGTCTAAATCTATAGCAGCCAGAACTGTAGACAC 
CAAAATGGAAAGGAATCGATGTATTCCAAGAGTGTGGAAATAAA^ 

AAGTGTCTCCTCCTCGTGTAGCAGCTTGAGAAGAGGCTTGGGAGTGCAGCTTCTCAAAGGAGAC 

TTGCTCAGGATGTCGAC^GCTGTGGCTTCCTTGTTTTTGCTAGCCATATTTTTAAATCAGGGTTGAACTG 

ACAAAAATAATTTAAAGACGTTTACTTCCCT 

TTTGG(^AAGTTGCAGTTTGTTCOTGTTTTTAGTTTAGTTTTGTTTTGGTGTT 

ttcttcacagaccctttgtagcgtggtcaggtctgctgtaacatttcccaccaactctcttgctgtccac 

atcaacagctaaatcatttattc^tatggatct^ 

tctctcattcacaagatgctttgaaggttctgatc^ 

tgtattcttcactactgagtttcttcttc^ 

taaagcatttatttgtcaataaactgcct 

Human. CBL-B mRNA sequence - var4 (public gi: 862408) (SEQ ID NO: 356) 

CTGGGTCCTGTGTGTGCCACAGGGGTGGGGTGTCCAGCGAGCGGTCTCCTCCTCCTGOTAGTGCTGCTGC 
GGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAATTGT 
TGCX3TTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGTTCTCAGCGAT 
CCC^CTCCCAGCCGGGGCTCCCCACACACACTGGGCT^ 

TGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAAT^ 

CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATGCTATTCAGGATGCAGT 

TGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAGAAGACTTGGAAGCTCATGGACAAAGTG 

gtaagactgtgccaaaatcccaaacttcagttgaaaaatagcc 

atacatatcagctvtttacgacttatattgagtaaat^ 

gaatgagtactttaaaatctacattgatagcctta^ 

gaaggcaaggagagaatgtatgaagaacagtcacaggacagacgaaatctcacaaaactgtcccttat 
tcagtga£^tgctggcagaaatcaaagcaatctttc 

CACAAAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAGACAAAACTAT 
TTCAGACAGTGCCTTCATGAGGTCCACCAGATTAGCTCT^ 
TTGATTTAACTTGCAATGATTACATTT(^ 
GGGCTCTATTTTGCGGAATTGGAATTTCTTAGCTGTGAC^ 

GATGAAGTTAAAGCACGACTACAGAAATATAGCACCAAACCCGGAAGCTATATTTTCCGGTTAAGTTGCA 
CTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTACAGACCATACCTCATAA 
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CAAGCCCTTATTTCAAGCCCTGATTGAT^ 

TATAATCCTGATTTAACTGGATTATGTGAACCTA<^ 

ATGAATTATATTGTGAAATGGGCTCCACTTTTCAGCTCTC 

CAAGATTGAGCCTTGTGGGCATTTGATGTGCACCTCTT^ 

GGCTGCCCTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCAT^ 

ATGAAGGCTCCAGGTGTTGCAGCATC^TTGACCCCTT^ 

TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCAAACXaTCCGAAAG 

CCAGTCACATCACCAGGATCCTCTCCCCTTGCCCAGAGAA 

CACATCTAAGCCTGCCACCOSTGCCT^ 

TGGCAGCCCAACAGGTTCACCAAAGTCTTCTCCTTGCATC 

CCACCTCCTCCCTTAAGAGATCCTCCTCCAC^ 

GACTGAGTAGAGAXIATCCATCATGTGGAAAGCGTGCC 

GTGCCCTCGGGATGTGTTTCGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGG 

CCTGGAATGACAGCGAGTTCAAATGTCAATGGAAGGGA£A^ 

GGAAAGACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCTTTTCCAA 

AGAATATGATGTTCCTCCCCXKSCTTTCrCCTCCTCCTCCAGTTACCACCCTCCTCCCTAGC^ 

ACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGAGACCCAGTAG^ 

AGATTCCTTCATCCGACCCTGTTTCCCTGAAT^ 

TCGGTCCTGTGATAATGGTCACTGTATGCTGAATGG 

ATCCCTGACTTAAGCATATATTTAAAGGGAGATGTTTTTGATTCAGCCTCTGATCCCG 

CTGCCAGGCCTCCAACTCGGGACAATCCAAAGCATGGTTCTTCA.CT 

TOTTCTCATCCCTCCATTAGGTTGAAACCTTTAAAAAAGTTTTGAACAACCCACCCCT 

TTCAGAATTTTCAGAATTCAGAGTT CAGTATAACACAGACT CACTGGGTTGTGAATTTGCCTGAAATTTG 

AATGGGTTCTCCAGGTGCCGGTGACTCCCAAGTTCACGAGACCATTACTCCATGTAGATGAT^ 

TAGTGTAGTAGTTGGGCATCAGTCAGGTTTTAAGCAAGT^ 

CACATGAGAGCTTTGTGCTCTAGTAGTTTTGAA 

ATAATTCTTAATAAATATGAACTTGCTTTTCTTGCAGCATGAGGACGAGTTCCACT 

TATGCAAAATTAAATAGTTGTATGTAGAGAACTGATAATAAATTCTGTTTTATTOTAATCATTACAACTG 
T AACACATT CAAAAAAAAAAA 

Human CBL-B mRNA sequence - var5 (public gi: 862410) (SEQ ID NO: 357) 

CTGGGTCCTGTGTGTGCCACAGGGGTGGGGTGTCCAGCGAGCGGTCTCCTCCTCCTGCTAGTGCTGCTGC 
GGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCTTGACGGTAATTGT 
TGCGTTTCCACGTCTCGGAGGCCTGCGCGCTGGGTTGCTCCTTCTTCGGGAGCGAGCTGTTCTCAGCGAT 
CCCACTCCCAGCCGGGGCTCCCCACACACACTGGGCT 

TGTCTCTGGACAGCTACGGCGCCGAAAGAACTAAAATTCCAGATGGCAAACTCAATGAATGGCAGAAACC 
CTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATGCTATTCAGGATGCAGT 
TGGACCCCCTAAGCAAGCTGCCGCAGATCGCAGGACCGTGGAGAAGACTTGGAAGCTCATGGACAAAGTG 
GTAAGACTGTGCCAAAATCCCAAACTTCAGTTGAAAAATAGCCC^ 

ATACATATCAGCATTTACGACTTATATTGAGTAAATATGATGACAACCAGAAACTTGCCCAACTCAGTGA 
GAATGAGTACTTTAAAATCTACATTGATAGCCTTATGAAAAAGTCAAAACGGGCAATAAGACTCTTTAAA 
GAAGGCAAGGAGAGAATGTATGAAGAACAGTGACAGGACAGACGAAATCTCAGAAAACTGTCCCTTATCT 
TC^GTCAC^TGCTGGCAGAAATCAAAGCAATCTT^ 

CACAAAAGCAGATGCTGCTGAATTCTGGAGAAAGTTTTTTGGAGACAAAACTATCGTACCATGGAAAGTA 
TTCAGACAGTGCCTTCATGAGGTCCACCAGATTAGCTCTAGCCTGGAAGCAATGG 

TTGATTTAACTTGCAATGATTACATTTCAGTTTTTGAATTTGATATTTTTACCAGGCTGTTTCAGCCTTG 
GGGCTCTATTTTGCGGAATTGGAATTTCTTAGCTGTGACACATCCAGGTTACATGGCATTTCTCACATAT 
GATGAAGTTAAAGCACGACTACAGAAATATAGCACCAAACCCGGAAGCTATATTTT 

CTCGATTGGGACAGTGGGCCATTGGCTATGTGACTGGGGATGGGAATATCTTACAGACCATACCTCATAA 

CAAGCCCTTATTTCAAGCCCTGATTGATGGCAGCAGGGAAGGATTTTATCTTTATCCTGATGGGAGGAGT 

TATAATCCTGATTTAACTGGATTATGTGAACCTACACCTCATGACCATATAAAAGTTACy^CAGGAACAAT 

ATGAATTATATTGTGAAATGGGCTCCACTTTT 

CAAGATTGAGCCTTGTGGGCATTTGATGTGCACCTCTTGCCT 

GGCTGCCOTTTCTGTCGTTGTGAAATAAAAGGAACTGAGCCCATAATCGTGGACCCCTTTGATCCAAGAG 

ATGAAGGCTCCAGGTGTTGCAGCATCATTGACCCCTTTGGrCATGCCGATGCTAGACTTGGACGACGATGA 

TGATCGTGAGGAGTCCTTGATGATGAATCGGTTGGCA^CGTCCGAAAGTGCACTGACAGGCAGAACTCA 

CCAGTCACATCACCAGGATCCTCTCCCCTTGCC(^GAGAAGAAAGCCA(^GCCTGACCCACTCCAGATCC 

CAGATCTAAGCCTGC(^CCCGTGCCTCCTCGCC 

TGGCAGCCCAACAGGTTCACCAAAGTCTTCTCCTTGCATGG 

CCACCTCCTCCCTTAAGAGATCCTCCrCC^ 

GACTGAGTAGACAGATC(^TCATGTGGAAAGCGTGCCTTCCAGAG 

GTGCCCTCGGGATGTGTTTGGGACTAATCAGCTTGTGGGATGTCGACTCCTAGGGGAGGGCTCTCCAAAA 
CCTGGAATCACAGCGAGTTCAAATGTGAATGGAAGGCACAGTAGAGTGGGCTCTGACCCAGTGCTTATC 
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GGAAACACAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGGTCCT 
AGAATATGATGTTCCTCCCCGGCTTTCTCCT 

ACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGAGACCCAGTAGAGG 
AGATTCCTTGATCCCACCCTGTTTCCCTG 

TCGGTCCTGTGATAATGGTCACTGTATGCTCAATGGAACACATGGTCC^ 

ATCCCTGACTTAAGCATATATTTAAAGGGTACGTATAGAATATAATTTCCTTTGTGATGTACATCTTAA 

GGTCAGAATTTAAAGGCAAAATTTCATGCCATTGTACTGAAAATACA 

GGAGATGTTTTTGATTCAGCCTCTGATCCCGTC 

CAAAGCATGGTTCTTCACTCAACAGGACGCCCTCT^ 

CCTTTAAAAAAGTTTTGAACAACCCACCCCTCCTTCTTTTAATOT 

GTATAACACAGACTCACrGGGTTGTGAATTTGCCTGAAATTTGAATO 

CCAAGTTCACGAGACCATTACTCCATGTAGATGATTAAGGTAGTAGTGTAGTAGTTGGGCATCAGT 
TTTTAAGCAAGTTGTTTTGTCCATACTAAATGTAGTCT 
TTTGAAGTGATGACTTGAAGTGTTGAGATTTTCTTTAAGTATAATAAT^ 
TTTCTTGCAGCATGAGCACCAGTTCCACTTACGCT 

AGAACTGATAATAAATTCTGTTTTATTCTAATCATTACAACTGTAACACATT 

Human CBL-B noRNA sequence - var6 (public gi: 21753192) (SEQ ID NO: 358) 

AGTGCTGCTGCGGCGTCCCGCGGCCTCCCCGAGTCGGGCGGGAGGGGAGAGCGGGTGTGGATTTGTCT'TG 
ACGGTAATTGTTGCGTTTCC^CGTCTCGGAGGCCTGCGCGCTGGGTTGCT 
TTCTCAGCGATCCCACrrCCCAGCCGGGGCTCCCCACACACACTGGGCTC 
GCGCACACGCGTGTCTCTGGACAGCTACGGCGCCXSAAAGAACTAAAATTCCl^ 

TGGCAGAAACCCTGGTGGTCGAGGAGGAAATCCCCGAAAAGGTCGAATTTTGGGTATTATTGATGCTATO 

CAGGATGCAGTTGGACCCCCTAAGCAAGCTGCCGGAGA 

GTCAATGGAAGGC^CAGTAGAGTGGGCTCTGACCCAGTGCT 

CTTTAGAAGGAGCTAAGGTCTTTTCCAATGGTCACCTTGGAAGTGAAGAATATGATGTTCCTCCCCGGOT 

TTCTCCTCCTCCTCGAGTTACCACCCTCCTCCCTAGCATAAAGTGTACTGGTCCGTTAGCAA 

TCAGAGAAAAGAAGAGACCCAGTAGAGGAAGATGATGATGAAT^ 

CCCTGAATTCACAACCATCTC^TTGTCATAATGTAAAACCTCCTGTTCGGTCTTGTGATAATGGTCACTG 
TATGCTGAATGGAACACATGGTCCATCTTGAGAGA 

AAGGGAGATGTTTTTGATTCAGCCTCTGATCCCGTGCCATTACGACCTGCGAGGCCT 
ATCGAAAGCATGGTTCTTCACTCAACAGGACGCCCT 
AGATGCTTTTGATGCCCTCCCTCCATCTCTCCC^^ 
CATTCAAAACCTCCTGGCTCCAGTAGCCGGCCATCCTCAGC^ 

CCTTTGTTGATCTAGCAAGTGGCCAAGTTCCTTTGCCTCCTGCTAGAAGGTTACCAGGTGAAAATGTCAA 

AACTAAGAGAACATCAGAGGACTATGATCAGCTTC^ 

CCCCCTAAACGACGACCGCGCAGGACTGCACCAGAAAT^ 

CATTGGAAAATGT CGATGCAAAAATTG CAAAACTCATGGGAG AGGGTTATGCCTTTGAAGAGGTGAAGAG 
AGCCTTAGAGATAGCCCAGAATAATGTCGAAGTTGCCCGGAGCATCCTCCGAGAATTTGCCTTCCCTCCT 
CC^GTATCCCCACGTCTAAATCTATAGCAGCCAGAACTGTAGACACCAAAATGGAAAGCAATCGATGTAT 
TCG^GAGTGTGGAAATAAAGAGAACTGAGATGGAATTCAAGAGAGAAGTC 

CTTGAGAAGAGGCTTGGGAGTGCAGCTTCTCAAAGGAGACCGATGCTTGCTCAGGATGTCGACAGCTGTG 
GCTTCCTTGTTTTTGCTAGCCATATTTTTAAATCAGGGTTGAACT 

CTTCCCTTGAACTTTGAACCTGTGAAATGCTTTACCTTGTTTACAGTTTGGCAAAGTTGCAGTTTGTTCT 

TGTTTTTAGTTTAGTTTTGTTTTGGTGTTTTGTACCTGTACTGTGTTCTTCACAGACCCTTTGTAGCGTG 

GTGAGGTCTGCTGTAACATTTCCGACCAACTCTCT 

ATGGATCTCTACCATCCCCATGCCTTGCC^ 

GGTTCTGATTTTCAACTGATCAAACTAATGCAAAAAAAAAAAAAAAAAAA 

Human Cbl-b mRNA sequence - var 7 (SEQ ID NO: 359) 

CGTNTTTGGNANNCACTACAGGGGATGTTTAATACACACT 

G 

TAAGATACCCCACTCAAACCCATAAAAAAGAGCM 
A 

CCCATACGACGTACCAGATTACGCTCATATGGCCATGGAG^ 
T 

GGACTCTGACCCANTGCTTATGCGGAAAC^CAGACGCCATGATTTGCCTTTAGAAGGAGCTAAGG 
C 

ACCTTGGAAGTGAAGAATATGATGTTCCTCCCCGGCnTTCTCCT 
G 

TGTACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGA^ 
C 
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TTCATCCCACCCTGTTTCCCTGAATT 
<3 

GTCACTGTATGCTGAATGGAACACATGGTCCATCTT<^ 
G 

GGTGAAGATGCTTTTGATGCCCTCCCTCCM^ 
C 

AAAACCTCCTGGCTCCAGTAGCCGGCCT^ 
G 

CAAGTGGCCAAGTTCCTTTGCOTCCCGCTAGAAGGTTACCAGGTGAAAA 
T 

GATGAGCTTCCTTCATGTTCAGATGGTTCACAGGCACCAGCGAGACCCCCT 
A 

AATTCACCACAGAAAACCCCATGGGCCTGAGGCGGCATTGGAAAATGT 
G 

GTTATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGTCGAAGTTGCCCGG 
A 

TTTGCCTTCCCTCCTCCAGTATCCCCACGTCTAAATCTATAGCAGCCAGAAOTGTA 
A 

TGTATTCCAAGAGTGTGGAAATAAAGAGAACTGAGATGGAATTCAAGAGAGAAGTGTCTCOT 
A 

GAAGAGGCTTGGGAGTGCAGCTTCTCAAAGL&AAACCGATGCTTC 
T 

GCTAGCCATTTTTTTAAATNAGGGTTGAACTNGANAAAANTATTTAAAAACGTTTACCTCCCTTG 
■G 

AAAGNC 

-Human Cbl-b Protein sequence - var 7 (SEQ ID NO: 361) 

MRKHRRHDLPLEGAKVSSNGHIiGSEEYDVPPR^ 

S 

LNSQPSHCHNVKPPVRSCDNGHCMIjNGTHGPSSEKKSNI PDLSIYIiKGEDAFDALPPSIjPPPPPPARHSIjIEHSKPPGS 

s 

SRPSSGQDLFIjLPSDPFVDLASGQVPLPPAIUILPGENVCTNRTSQDYDQLPSCSDGSQAPAR^ 

p 

HGPE AAIiENVDAKIAKLMGEGYAFEEVKRALE I AQNNVE VARS I LRE FAFP PPVS PRLNXi 

Human cbl-B clone3Gdll4 (partial sequence) (SEQ ID NO: 360) 

ACTCTGACCCAGTGCTTATGCGGAAACACAGACGCCATGATTTGCCTTTA 

GAAGGAGCTAAGGTCTCTTCCAATGGTCACCTTGGAAGTGAAGAATATGA 

TGTTCCTCCCCGGCTTTCTCCTCCTCCTCCAGTTACCACCCTCCTCCCTA 

GCATAAAGTGTACTGGTCCGTTAGCAAATTCTCTTTCAGAGAAAACAAGA 

GACCCAGTAGAGGAAGATGATGATGAATACAAGATTCCTTCATCCCACCC 

TGTTTCCCTGAATTCACAACCATCTCATTGTCATAATGTAAAACCTCCTG 

TTCGGTCTTGTGATAATGGTCACTGTATGCTGAATGGAACACATGGTCCA 

TCTTCAGAGAAGAAATCAAACATCCCTGACTTAAGCATATATTTAAAGGG 

TGAAGATGCTTTTGATGCCCTCCCTCCATCTCTCCCACCTCCCCCACCTC 

CTGCAAGGCATAGTCTCATTGAACATTCAAAACCTCCTGGCTCCAGTAGC 

CGGCCATCCTCAGGACAGGATCTTTTTCTTCTTCCTTCAGATCCCTTTGT 

TGATCTAGCAAGTGGCCAAGTTCCTTTGCCTCCCGCTAGAAGGTTACCAG 

GTGA2\AATGTCAAAACTAACAGGACATCACAGGACTATGATCAGCTTCCT 

TCATGTTCAGATGGTTCACAGGCACCAGCCAGACCCCCTAAACCACGACC ■ 

GCGCAGGACTGCACCAGAAATTCACCACAGAAAACCCCATGGGCCTGAGG 

CGGCATTGGAAAATGTCGATGCAAAAATTGCAAAACTCATGGGAGAGGGT 

TATGCCTTTGAAGAGGTGAAGAGAGCCTTAGAGATAGCCCAGAATAATGT 

CGAAGTTGCCCGGAGCATCCTCCGAGAATTTGCCTTCCCTCCTCCAGTAT 

CCCCACGTCTAAATCTATAGCAGCCAGAACTGTAGACACCAAAATGGAAA 

GCAAlfcGATGTATTCCAAGAGTGTGGAAATAAAGAGAACTGAGATGGAAT 

TCAAGAGAGAAGTGTCTCCTCCTCGTGTAGCAGCTTGAGAAGAGGCTTGG 

GAGTGCAGCTTCTCAAAGAAAACCGATGCTTGCTCAGGATGTCGACAGCT 

GTGGCTTCCTTGTTTTTGCTAGCGATTTTTTTAAATCAGGGTTGAACTGG 

AAAAAATTATTTAAAAACGTTTACCTCCCTTGAACTTTGAACCTGGGAAA 
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GGC 

Human CblB protein in 3Gdll4 Translation of cbl-B clone3Gdll4 starting 
at base pair 3 (SEQ ID NO: 398) 

SDPVLMRKHRRHDLPLEGAKVSSNGHLGSEEYDVPPRLSPPPPVTTLIjPS 

I KCTGPLANSLSEKTRD PVEEDDDEYKI PS SHPVSLNSQPSHC^^ 

RS CDNGHCMLNGTHGPS SEKKSNI PDLS I YLKGEDAFDALPPSLPPPPPP 

ARHSLIEHSKPPGSSSRPSSGQDLFLLPSDPFVDIASGQVPLPPARRLPG 

ENVKTNRTSQDYDQLPSCSIX3SQAPARPPKPRPRRTAPEIHHRKPHGPEA 



Human CBL-B Protein sequence - varl (public gi: 4757920) (SEQ ID NO: 227) 

MANSMNGRNPGGRGGNPRKGRI LGI IDAIQDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQLKNS 

PPYILDILPDTYQHLRLILSKYDDNQKLAQLSENEYFKIYIDSLMKK^ 

RNLTKLSLIFSHMLAEIKMFPNGQFQGDNFRITKADAAE 

LEAMALKSTIDLTCNDYI SVFEFDI FTRLFQP WGS I LRNWNFLAVTHPGYMAFLTYDEVKARLQKYSTKP 

GSYI FRLSCTRLGQWAIGYVTGDGNILQTI PHNKPLFQALIDGSREGFYLYPDGRS YNPDLTGLCEPTPH 

DHIKVTQEQYELYCEMGSTFQLCKICAENDKDVKIEPCGHLMCTS^ 

I IVDPFDPRDEGSRCCSI IDPFGMPMLDLDDDDDREESLMMNRLANVRKCTDRQN^^ 

KPQPDPLQIPHLSLPPVPPRLDLIQKGIVRSPOSSPTGSPKSSPCMVRra^ 

RPPPIPPDNRLSRHIHHVESVPSRDPPMPLEAWCPRDVFGTNQLVGCRLI^EGSPKPGITASSNVNG^ 
RVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDVPPRLSPPPPVTTLLPS I KCTGPLANSLSEKTRD 
PVEEDDDEYKI PS SHPVSLNSQPSHCHNVKPPVRSCDNGHCMLNGTHGPS SEKKSNI PDLS I YLKGTYRI 

Human CBL-B Protein sequence - var2 (public gi: 23273909) (SEQ ID NO: 228) 

MANSMNGRNPGGRGGNPRKGRI LGI IDAIQDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQL^ 
PPYILDILPDTYQHIiRLIIjSKYDDNQKIAQLSENEYFK^ 

RNLTKLSLI FSHMLAE I KAI FPNGQFQGDNFRI TKADAAEFWRKFFGDKTIVPWKVFRQCLHEVHQI SSG 

LEAMALKSTIDLTCNDYISWEFDIFTRLFQPWGSILRNWNFLAVra^ 

GSYIFRLSCTRIiGQWAIGYVTGDGNILQTIPHNKPLFQALIDGSREGFYLYPTC 

DHIKVTQEQYELYCEMGSTFQLCKICAENDKDVKIEPCGHLMCTSCL^ 

I IVDPFDPRDEGSRCCSI I DP FGMPMLDLDDDDDREESLMMNRLANVRKCT^ 

KPQPDPLQIPHLSLPPVPPRLDLIQKGIVRSPCGSPTGSPKSSPCMVRKQDKPLPAPPPPLRDPPPPPPE 
RPPPIPPDNRLSRHIHHVESVPSKDPPMPLEAWCPRDVFGTNQLVGCRLLGEGSPKPGITASSNVNGRHS 
RVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDVPPRLSPPPPVTTLLPS 

PVEEDDDEYKIPSSHPVSLNSQPSHCHNVKPPVRSCDNGHCMLNGTHGPSSEKKSNI PDLS I YLKGDVFD 
SASDPVPLPPARPPTRDNPKHGSSLNRTPSDYDLLIPPLGEDAFDALPPSLPPPPPPARHSLIEHSKPPG 
SSSRPSSGQDLFLLPSDPFVDLASGQVPLPPARRLPGENVKTNRTSQDYDQLPSCSDGSQAPARPPKPRP 
RRTAPE IHHRKPHGPEAALENVI5AKI AKLMGEGYAFEEVKRALE I AQNNVEVARS I LRE FAFPPPVS PRL 



Human CBL-B Protem sequence - var3 (public gi: 862407) (SEQ ID NO: 229) 

MANSMNGRNPGGRGGNPRKGRI LGI I DAI QDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQLKNS 
PPYILDILPDTYQHLRLILSKYDDNQKLAQLSENEYFKIYIDSLMK3CSKRAIRLFKEG 
RNLTKLSLIFSHMLAEIKAIFPNGQFQGDNFRITKADAAEFWRKFFGDKTIVPWKVFRQC^H^ 
LEAMALKSTIDLTCNDYI SVFEFDI FTRLFQPWGS ILRNWNFLAVTHPGYMAFLTYDEVKARLQKYSTKP 
GSYI FRLSCTRLGQWAIGYVTGDGNILQTI PHNKPLFQALIDGSREGFYLYPDGRS YNPDLTGLCEPTPH 
DHI KVTQEQYELYCEMGSTFQLCKI CAENDKDVKI E PCGHLMCTS CLTAWQESDGQGCPFCRCE I KGTE P 
I IVDPFDPRDEGSRCCS I IDPFGMPMLDLDDDDDREESLMMNRLANVRKCTDRQNSPVTSPGSSPLAQRR 
KPQPDPLQIPHLSLPPVPPRLDLIQKGIVRSPCGSPTGSPKSSPCMVRKQDKPLPAPPPPLRDPPPPPPE 
RPPPI PPDNRLSRHIHHVESVPSRDPPMPLEAWCPRDVFGTNQLVGCRLLGEGS PKPGI TASSNVNGRHS 
RVGSDPVLMRKHRRHDLPLEGAKVFSNGHLGSEEYDTO^ 

PVEEDDDEYKI PSSHPVSLNSQPSHCHNVKPPVRS CDNGHCMLNGTHGP S SEKKSNI PDLS I YLKGDVFD 
SASDPVPLPPARPPTRDNPKHGSSLNRTPSDYDLLIPPLGEDAFDALPPSLPPPPPPARHSLIEHSKPPG 
SSSRPSSGQDLFLLPSDPFVDLASGQVPLPPARRLPGENVKTNRTSQDYDQLPSCSDGSQAPARPPKPRP 
RRTAPE IHHRKPHGPEAALENVDAKI AKLMGEGYAFEEVKRALE I AQNNVEVARS I LREFAFPPPVS PRL 



Human CBL-B Protem sequence - var4 (public gi: 862409) (SEQ ID NO- 230) 

MANSMNGRNPGGRGGNPRKGRI LGI IDAI QDAVGPPKQAAADRRTVEKTWKLMDKVVRLCQNPKLQLKNS 
PPYILDILPDTYQHLRLILSKYDDNQKLAQLSENEYFKIYIDSLMKKSKRAIRLFKE 
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LEAM^KOTIDIjTCNDYI SVFEFDI FTRLFQPW6S IIJ^NWNFLAVTHPGYMAFIjTYDBVKARIjQKYSTKP 

I IVDP FDPRDEGSRCCS I IDPFGMPMLDIjDDDDDREESLMMNRIiANVRKCTDRQNS PVTS PGS S PLAQRR 
CT^D^IPHLSLPPVPPRI^LIQKGIV^^ " 

^ppiptonri^rhihh^ 
.^gsdpvLrkhrrhdlplegak^ 

PVEEDDDBYKIPSSHPVSLNSQPSHCHNV^ 
SASDPVPLPPARPPTRDNPKHGSSLNRTPSDYDLIil PPLG 

Unigene Name: CENTB1 Unigene ID: Hs . 337242 

Human CENTB1 mRNA sequence - varl (public gi: 495679) (SEQ ID NO: 37) 

GGGGTGAGAGCTCCTCCTAGGACACCCCTTTCCC 

CGGAGGTCCCTCTCCTCCTTCCCCCTCATCTCCCCTTCCTGGGACAGAAAGTGCCTCCACCTGCATCCCC 

aggggcccggcctccagggcccgctggccccacagcaggcaagctgagatgacggtcaagc 
aggagtgtctcaaggactcaccccgtttccgagcctctattgagctggtggaagccgaagtgtcagaatt 
ggagacccgtctogaaaagctcctgaaactgggcactggtctcctggaaagtgggcgccat^^ 
gccagccgcgccttcgttgtcggcatttgtgacctggcccgcctgggtccaccagagcccatgatggcgg 

agtgtctggaaaaattcaccgtgagcctgaaccacaa^ 

C(^CA(^CACTGCAGCAGCAGATCCAGACCCTGGTCAAGGAAGGTCTGCGGGGTTTCCGAGAGGCTCG 

CGGGATTTCTGGCGGGGGGCTGAGAGCCTGGAGGCTGCCCTGACCCACAACGCAGAGGTTCCCAGGCGCC 

GGGCCCAGGAGGCAGAAGAGGCAGGAGCTGCTTTGAGGACGGCTCGAGCTGGGTACCGGGGACGGGCACT 

GGATTATGCCCTGCAGATCAACGTGATTGAGGACAAGAGGAAGTTTGACATCATGGAGTTTGTGCTGCGT 

TTGGTGGAGGCCCAGGCTACCCATTTCCAGCAGGGCCATGAGGAGCTGAGCCGGCTGTCCCAGTATCGAA 

AGGAGCTGGGCGCCCAGTTGCACCAGCTGGTCTTGAATTCAGCACGAGAGAAGAGGGACATGGAGCAGAG 

AC^CGTGCTGCTGAAACAGAAGGAGCTGGGTGGGGAGGAGCCAGAACCAAGCTTAAGAGAGGGGCCTGGT 

GGCCTGGTGATGGAAGGACATCTCTTCAAACGGGCCAGCAACGCATTTAAGACCTGGAGCAGACGCTGGT 

TCACCATTCAGAGCAACCAACTGGTTTACCAGAAGAAGTACAAGGACCCTGTGACTGTGGTGGTGGATGA 

CCTTCGTCTCTGCACAGTGAAACTCTGCCCTGACTCAGAAAGGCGGTTCTGCTTTGAGGTGGTGTCCACC 

AGCAAGTCCTGCCTCCTCCAGGCTGACTCAGAGCGCCTCCTGCAGCTGTGGGTCAGTGCT 

GCATTGCTTCTGCCTTCAGTCAGGCTCGCCTTGATGACAGCCCCCGGGGT^ 

CCTGGCCATAGGCTCTGCTGCCACCCTGGGCTCTGGTGGAATGGCCAGGGGAAGGGAGCCTGGGGGAGTC 

GGGCACGTGGTGGCCCAGGTCCAGAGTGTGGATGGCAATGCCCAGTGCTGCGACT^^ 

CGGAGTGGGCCAGCATGAACCTTGGTGTCACCCTCTGCATTCAGTGTT^ 

TGTTCACTTCTCCAAAGTCCGGTCTCTGACCCTTGACTCATGGGAGCCAGAACTAGTGAAGCTCATGTGT 

GAGCTGGGAAATGTCATCATCAACCAGATCTATGAGGCCCGCGTGGAGGCCATGGCAGTGAAGAAACCAG 

GGCCCAGCTGCTCCCGGCAGGAGAAGGAGGCCTGGATTCACGCTAAATACGTGGAGAAGAAGTTCCTGAC 

CAAGCTGCCTGAGATTCGAGGGCGAAGAGGTGGCCGGGGGCGCCCAAGGGGGCAGCCTCCTGTGCCCCCA 

AAGCCTTCCATCAGGCCCCGGCCAGGGAGCTTGAGATCCAAGCCAGAGCCCCCCTCTGAGGACCTGGGAA 

GCCTGCACCCTGGGGCCCTAOTGTTTCGAGCGTCTGGGC^TCCTCCATCTCTTCCCACCATGGCTGATGC 

CCTTGCCCATGGAGCTGATGTCAACTGGGTCAATGGGGGCCAAGATAATGCCACACCGCTGATCCAGGCC 

ACAGCTGCTAATTCTCTTCTGGCCTGTGAGTTTCTCCTCC^GAACGGGGCGAACGTGAACCAAGCGGACA 

GTGCGGGCCGGGGCCCGCTGCACCACGCAACCATTCTTGGCCACACGGGGCTCGCCTGCCT 

ACGGGGAGCTGATCTGGGGGCTCGAGACTCTGAAGGCAGGGACCCTCTGACCATCGCCATGGAAACAGCC 

AACGCTGAC^TCGTCACCCTGCTACGACTGGCAAAGATGAGGG^ 

GAGATGAGACGTATCTTGACATCTTCCGCGACTTCTCCCTCATGGCGTCAGACGACCCGGAGAAGCTGAG 
CCGTCGCAGTCATGACCTCCACACGCTGTGACCCGAGGCCCACGGGGCCCGCGCCTGCCTCCCTTCCCCG 
CCACCGGGCCCTCTGCCATTAAAGCCTCCGTGCTTCGCTCTTCC 

Human CENTB1 mRNA sequence- var2 (public gi: 17391288) (SEQ ID NO: 38) 

GAGCTCCTCCTAGGACACCCCTTTCCCCTTGGGGAAAGGATTGTGCCCCCAGGCCCTTCCCCGCGGAGGT 
CCCTCTCCTCCTTCCCCCTCATCTCCCCTTCCTGGGA<^GAAAGTGCCTCCACCTGCATCCCCAGGGGCC 
CGGCCTCCAGGGCCCGCTGGCCCCACAGCAGGCAAGCTGAGATGACGGTC^^GCTGGATTTCGAGGAGTG 
TCTCAAGGACTCACCCCGTTTCCGAGCCTCTATTGAGCTGGTGGAAGCCGAAGTGTCAGAATTGGAGACC 
CGTCTGGAAAAGCTCCTGAAACTGGGCACTGGTCTCCTGGAAAGTGGGCGCCATTACCTTGCTGCCAGCC 
GCGCCTTCGTTGTCGG(^TTTGTGACGTGGCCCGCCTGGGTCC^CCAGAGCCCATGATGGCGGAGTGTCT 
GGAAAAATTCACCGTGAGCCTGAACCACAAGCTGGACAGCCATGCGGAGCTTCTAGATGCCACCCAACAC 
ACACTGCAGCAGCAGATCC^GACCCTGGTCAAGGAAGGTCTGCGGGGTTTCCGAGAGGCTCGCCGGGATT 
TCTGGCGGGGGGCTGAGAGCCTGGAGGCTGCCCTGACCCACAACGCAGAGGTTCCCAGGCGCCGGGCCCA 
GGAGGCAGAAGAGGCAGGAGCTGCTTTGAGGACGGCTCGAGCTGGGTACCGGGGACGGGCACTGGATTAT 
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GCCCTGCAGATCAACGTGATTGAGGACAAGAGGAAGTTTGACATCATGG^ 

AGGCCCAGGCTACC(^TrrCCAGCAGGGCC^TGAGGAGCTGAGCCGGCTC 

GGGCGCCCAGTTGCACCAGCTGGTCTTGAATTCAG 

CTGCTGAAACAGAAGGAGCTGGGTGGGGAGGAGCCAGAACCAAGCTTA 

TGATGGAAGGACATCTCTTCAAACGGGCCAGCAACGCATTTAAGACC^^ 

TCAGAGCAACCAACTGGTTTACCAGAAGAAGTACAAGGACCCTGTGA 

<rrCTGCACAGTGAAACTCTGCCCTGACT^^ 

CCTGCCTCCTCCAGGCTGACTCAGAGCGCCT 

TTCTGCCTTCAGTCAGGCTCGCCITGATGACAGCrc^ 

ATAGGCTCn^eiX3CCACCCTGGGerCTGGTGGAATGGCCAGGGG 

TGGTGGCCCAGGTCCAGAGTGTGGATGGCAATGCCCAGTGCTC^ 

GGCCAGCATGAACCTTGGTGTCACCCTCTGCATTCAGTO 

TTCTCCAAAGTCOGGTCTCTGACCCTTGACTCATGGGAGCCAGAACTAGTGAAGCTCATGTC 

GAAATGTGATCATCAACCAGATCTATGAGGCCCGCGTGGA 

CTGCTCCCGGCAGGAGAAGGAGGCCTGGATTCACGCTAAATACGT^ 

CCTGAGATTCGAGGGCGAAGAGGTGGCCGGGGGCGCCCAAGGGGGCAGCCTCCTGTGCCCCCAAAGCCTT 
CCATCAGGCCCCGGCCAGGGAGCTTGAGATCCAAGK^GAGAGCCCCCCTCTGAGG 

CCCTGGGGCCCTACTGTTTCGAGCGTCTGGGCATCCTCCATCTCTTCCCACCATGGCTGATGCCCTTGCC 

CATGGAGCTGATGTCAACTGGGTCAATGGGGGCCAAGATAATGCCACAC 

CTAATTCTCH^CTGGCCTGTGAGTTTCTCCTCCA^ 

CCGGGGCCCGCTGCACCACGCAACGATTCTTGGCCACACGGGGCTCGCCTC 
GCTGATCTGGGGGCTCGAGACTCTGAAGGCAGGGACCCTCTGAC 

ACATCGTCACCCTGCTACGACTGGCAAAGATGAGGGAGGCTGAAGCGGCCCAGGGGCAGGCAGGAGATGA 
GACGTATCTTGACATCTTCCGCGACTTCTCCCTCATGGCGTCAGACGACCCGGAGAAGCTGAGCCGTCGC 
AGTCATGACCTCCACACGCTGTGACCCGAGGCCCACGGGGCCCGCGCCTGCCTCCCTTCCCCGCCACCGG 
GCCCTCTGCCATTAAAGCCTCCGTGCTTCGCTCAAAAAAAAAAAAAAA 

Human CENTB1 mRNA sequence - var3 (pubHcgi:34533 0i4)(SEQIDNO:39) 

ATGTCAGCGTTGGCTGTTTCCATGGCGATGGTCAGAGGGTCCCTGCCTTCAGAGTCTCGAGCCCCCAGAT 
CAGCTCCCCGTTTCAGGAACAGGCAGGCGAGCCTGGAGAGAAGAGCCAGGGTCAGCCGGCCGCCCAA 
CTCCCAGCCTTCCTCCCCATGCCATCATCCCTACCC^ 
CCCCGGCCCGCACTGTCCGCTTGGTTC^CGTTC^ 

AATTCTGCATGGAGAAGTCGAGAAGGGGGGTTGAGGGTGG(^TCCCrAGTGGTGGATTTGAAGATGTCTT 
AGGGTGGCGCCAGTTCAGAGAATGGGAGGGTGGAGTGTGGTAATCAGGAGTGTGGAAGGGGTTACAGCTA 
ACTGTAACCAAGCrAGGCTTGGCTCTAGCTCTTTG^ 

TGAGGTATGTTACTATTTTACAGATGAGGCTGAGAGGTTAATAACTTGTTAAAAGTCTCCTGTAGGCCGG 
GCACAGTGGCTCACGCCAGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACAGGGTCAGGAGA 
TCCAGACCATCCTGGCTAGKZACGGTGG&GCCCTATCTCTACT 

TGGCTGGCGCCTGTGGTCCCAGCTACTCGGGCAGCTGAGGCAGGAGAATGGTGTGAACCCGGGAGGCGGA 

GCTTGCAGTGAGCCGAGATCGCACCATTGCACTCCGGCCTGGGGGACGGAGCGAGAC 

AAAAAAAAGTCTCCTGTAAGAGGTGAGAGCCTGGGTTCA^ 

TCTTAGCAACCAGTCTCTATTGTTGATCTCTCCCTATGGGTGGAAGCCCTAGGGAACAGGTGGTGGGGAA 
AGGAGGTAAGGGGAGGGCCCAGAGTCAGGAGTAGGTGTCAGAG 

GGGCTCTTA(^GCAGCTGTGGCCTGGATCAGCGGTGTGGCATTATCTTGGCCCCCATTGACCCAGTTGAC 

ATCAGCTCCATGGGCAAGGGCATCAGCCATGGTGGGAAGAGATGGAGGATGCCCAGACGCTCGAAACAGT 

AGGGCCCCAGGGTGCAGGCTTCCCAGGTCCTCAGAGGGGGGCTCTGTTCGGGGGATTTGGTTCTGTTAGG 

GGGAAGCAGCTCCGAGTCTGGGAAGAAAACCCTCAGCAGTGTCCCAATGCTATAATGGGA 

CTAAATGATGGGGAGCTTGGGACTGTGGAGGGAATAGAGTGATGCAAGTGTGGGTATGTGTAAGTATGCG 

TATGCATGTGTACX3AGTCCCTAGGGTGTGGGGGAGAGACGGCA 

TTGGCCTC^GCTCTC^CCCCTGACGCTC^^ 

TTGGCACCCCTTACCCTCCCTGCCCACGCCCAGCCC 

CGTAACCCCTGAAACGGCAGTCCGGTCCCTCCGACATTGTCCAGC 

GCCTCCCGGCGCTACCTGGCACGATGCCGAGC^CACAGCAGATGCTCAATGAATGCCCGACCAACCCTAT 
ACCTGGCTTGGATCTCAAGCTCCCTGGCCGGGGCCTGATGGAAGGCTTTGGGGGCACAGGAGGCTGCCCC 
CTTGGGCGCCCCCGGCCACCTCTTCGCCCTCGAATCTCAGGCAGCTTGGTCAGGAACTTCTTCTCCACGT 
ATTTAGCGTGAATCCAGGCCTCCTTCTCCTGCCTGTGGGAGGG 

CTCCAGGGGTCCCCCATTCCCCTGGGAGGCTAAACCCCAAGCTCACCGGGAGCAGCTGGGCCCTGGTTTC 

TTCACTGCCATGGCCTCmCGCGGGCCTCATAGATCTGGTTGATGATGACATTTCCCAGCTC^CACATGA 

GCTTCAGGAGGCCCAGGCAGAGGCAGAGACAGGGAAGGTGGGGGTGAGTGACTCCTCAGGGATCACGCCC 

CTGCACCGCCATGTCCTTGCCCCACCCCAAGTTCTTGCCCCCAATCTTCACAATACGCTAAGTTACCT 

ACTAGTTCTGGCTCCCATGAGTC^GGGTCAGAGACCGGACTTTGGAGAAGTGAACACCAAGGCTCCT^ 

AGGGCCAGAGGGGGAGGGTCAGGCCCTGTGCAGGGGGGCAGTGGCCTGGGGAGCTGCTGCTGCTCCTGAA 

GACACTGGGAGGCAAGGCTGGCATGGGGGCCCGTGCAGAGGTGCTGGCCCAGGAGGCAGGGCAGCTGCGG 
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CCATGTAACCGCCATGTAGCCITGACCTGGCCC^ 

AGGTTTCATTTCAAGGCCCTCATCACTC 

TGGACAACCTCTCCCATCTGACCTCC^ 

TAAGGCCTCATGACCGTTTCTCTGCCTAGTC^^ 

CTTCTTCCAACCAAGCCAACAAATTCT^ 

GGCCCGCCTGGGTCCAACTGGCACCCCAGAGGCIT^ 

GCTAAX3AGAGGAAGCCAGGACAGGGAAGGAAGAGGGGCCGG 

GTGGCCCGAAGGAGTGGGGCTCCCGCATGGGCCITGTGGAG 

AGAGGGTGACACCAAGGTTGATGCTGGCCC^CTCCGGGGCTGGCTCCCGGCAGT 

GCCATCCACACTCTGGACCTGGGCCACCAC^ 

CCAGAGCCCAGGGTGGCAGGAGAGCCTATGGCCAGGTGT^ 

GGTGGCCAAGGGGCCTAGGGTAAAGGGTGCCCCATCT^ 

AGGTACCTGGCCTGGACCCCGGGGGCTGTCATCAAGGCGA 

TGCACAGCACTGACCCACAGCTGCAGGAGGCXjCTCTGAGTCAGCCTGGAGGAGGC^ 

GGGTGAGGGAGATGGCAGAGGGGTCTGAGGCCTGGGAAGCAAAGTO^ 

GCCAGTATTC^CCAGTTCCAGTTGCATTGAAAGACTTCTGTACCAGTT 

CCCCATCACCCTGTACCCTCTTCCAC^TGGCCCCCCAGTCCAGCCGCCAAAGAATTAAATTAAAG 

GAGCTGCATGGGGGGCTTCC^TTGTGGTGGGCCCTGCCTTTCAGATTGGCAGTTGTTTAGATATATTAGA 

GTATCACCCCHX2KSGGATTGCACTCAC 

AGGGCAGAGTTTCACTGTGCAGAGACGAAGGTGATCCA^ 

GTGATAAGGGGCCAGATGTCCAGCTGCAGGCAAGAGCTGAGTCT 

AGGTCCACTCACCTTGTACTTCTTCTGGTAAACC^^ 

GAGAAGGAAATGATTACAGACATAGGCAGCTTTAGG^ 

ATGAGGATCTTACTGAGAGGAGAGACACCTGGGCTGACT 

AATTGTGCAGATTGGAACATCTAAAGGATGCTATTCCTATCTTGGACAACCCAGATTTCATATAGTTATG 
AAGACAACTTTCCAGCAGATGGCAGTAAAATTCTTTTTCTAATAAAATGTCTATTGCTA 
ATACTATTTAGGCTGGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGATGGGGGTGGTGGATCGCC 
CGAGGTCAGGAGTTTGAGACC^CCCTGACCAATATC^ 

AGCCAGGCGTGGTGGGAGGCGGCTATAATCCCACCTACTTGGGAGGCTGAGGCGGGAGAATCGCTTGAAC 
CGAGGAAGCTGAGGTTGCAGTGAGCTGGGATCGCACCACTGTGCTGCAGCCTGCGCAACATAGCGAGGCT 
CCATGAAAAAAGAAAAAAAAAAGAAAAAGAAAAAAAGAAAAGAAAGAA 
CTCACGCCTGTAGTCCCAGCAAGTTGGGAGGCCGAGGCG 

CAGCCTGGGCAACATGGTGAAACCCTGTTTCTACCAAAAATACAAAAATTAGCCG 

GCCTGTGGTCCCAGCTGTTTAGGATGCTGAGGAGGGAGGATCACTTC 

TGAGCCGAGACTGCGCCACTGCACTGCAGGCTGGGCAACAGA 

Human CENTB1 mRNA sequence - var4 (public gi: 32879918) (SEQ ID NO: 40) 

ATGACGGTCAAGCTGGATTTCGAGGAGTGTCTCAAGGACTCACCCCGTTTCCGAGCCTCTATTGAGCTGG 
TGGAAGCCGAAGTGTCAGAATTGGAGACCCGTCrGGAAAAGCTCCTGAAACTGGGCACTGGTCTCCTGGA 
AAGTGGGCGCCATTACCTTGCTGCCAGCCGCGCCTTCGTTGTCGGCATTTGTGACCTGGCCCGCCTGGGT 
CCACCAGAGCCCATGATGGCGGAGTGTCTGGAAAAATTCACCGTGAGCCTGAACCACAAGCTGGACAGCC 
ATGCGGAGCTTCTAGATGCCACCCAACACACACTGCAGCAGCAGATCCAGACCCTG 

GCGGGGTTTCCGAGAGGCTCGCCGGGATTTCTGGCGGGGGGCTGAGAGCCTGGAGGCTGCCCTGACCCAC 

AACGCAGAGGTTCCCAGGCGCCGGGCC<^GGAGGCAGAAGAGGCAGGAGCTGCTTTGAGGACGGCTCGAG 

CTGGGTACCGGGGACGGGCACTGGATOATGCCCTGCAGATCAACGTGATTGAGGACAAGAGGAAGTTTGA 

CATCATGGAGTTTGTGCTGCGTTTGGTGGAGGCCCAGGCTACCCATTTCCAGCAGGGCCATGAGGAGCTG 

AGCCGGCTGTCCCAGTATCGAAAGGAGCTGGGCGCCCAGTTGCACCAGCTGGTCTTGAATTCAGCA 

AGAAGAGGGACATGGAGCAGAGACACGTGCTGCTGAAACAGAAGGAGCTGGGTGGGGAGGAGCCAG^CC 

AAGCTTAAGAGAGGGGCCTGGTGGCCTGGTGATGGAAGGACATCTCTTCAAACGGGCCAGCAACGCATTT 

AAGACCTGGAGCAGACGCTGGTTCACCATTCAGAGCAACCAACTGGTTTACCAGAAGAAGTACAAGGACC 

CTGTGACTGTGGTGGTGGATGACCIT(X3TCTCrGCAG^^ 

CTGCTTTGAGGTGGTGTCCACCAGCAAGTCCTGCCTC 

TGGGTCAGTGCTGTGCAGAGCAGCATTGCTTCTGCCTT 

GTCCAGGCCAGGGCTCAGGACACCTGGCCATAGGCTCTGCTGCCACCCTGGGCTCTGGTGGAATGGCCAG 
GGGAAGGGAGCCTGGGGGAGTCGGGCACGTGGTGGCCCAGGTCCAGAGTGTGGATGGCAATGCCCAGTGC 
TGCX3ACTGCCGGGAGCCAGCCCCGGAGTGGGCCAGCATCAACCTTGGTGTCACC 

CCGGCATCCACAGGAGCCTTGGTGTTCACTTCTCGAAAGTCCGGTCTCTGACCCTTGACTCATGGGAGCC 
AGAACTAGTGAAGCTCATGTGTGAGCTGGGAAATGTCATCATCAACCAGATCTATGAGGCCCGCGTGGAG 
GCCATGGCAGTGAAGAAACCAGGGCCCAGCTGCrCCa^CAGGAGAAGGAGGCCTGGATTCACGCTAAAT 
ACGTGGAGAAGAAGTTCCTGACCAAGCTGCCTGAGATTCGAGGGCGAAGAGGTGGCCGGGGGCGCCCAAG 
GGGGCAGCCTCCTGTGCCCCCAAAGCCTTCCATCAGGCCCCGGCCAGGGAGCTTGAGATCCAAGCCAGAG 
CCCCCCTCTGAGGACCTGGGAAGCCTGCACCCTGGGGCCCTACTGTTTCGAGCGTCTGGGCATCCTCCAT 

ctcttccc^ccatggctgatgcccttgcccatggagct^^ 
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TGCCACACCGCTGATCCAGGCCACAGCTGCTAATTCTCT 
GCGAACGTGAACCAAGCGGACAGTGCGGGCOSGGGCC^ 
GGCTCXSCCTGCCTOTTCCnSAAACGGGGA 
GACCATCGCCATGGAAAGAGCCAACGCTGAGATTO 

GAAGCGGCCCAGGGGCAGGCAGGAGATGAGACGTATCTTGACATCTTCCX^ 
CAGAC^CCCGGAGAAGCTGAGCCGTCGCAGTCATGACCTCCACACGCTGTAG 

Human CENTB1 protein sequence - varl (public gi: 32879919) (SEQ ID NO: 231) 

mVKLDFEECLKDSPRFRASIELVEAEVSELET^ 
PPEPMMAECLEKFTVSIrNHKLDSHAELIiDATQH^ 

NAEVPRRRAQEAEEAGAALRTARAGYRGRALDYAIiQINVI EDKRKFDI MEFVLRLVEAQATHFQQGHEEL 

SRItSQYRKELGAQLHQIiVIjNSAREKRDMEQRHVLLKQKELGGEEPEPSLRE 

KTWSRRWFTIQSNQLVYQKKYKDPVTVVTO^ 

WSAVQSSIASAFSQAIUiDDSPRGPGQGSGHIiAIGSAATIjGSGGMARGREPGGVGHWAQVQSVDGNAQ 
mCREPAPEWASIlSrLGVTLCIQCSGIHRSIXjVHFSKVRSLTIJDSW 
AMAVKKPGPSCSRQEKEAWIHAKYVEKKFLTKLPEIRGRR 
PPSEDU3SI^PGALLFRASGHPPSLPTMADALAHGADV^^ 

ANVNQADSAGRGPLHHAT I IjGHTG3jACLFLKRGADLGARDSEGRDPLTI AMETANADI VTLLRIiAKMREA 
EAAQGQAGDETYIiDIFRDFSIiMASDDPEKLSRRSHDLHTL 

Human CENTB1 protein sequence - var2 (public gi: 34533015) (SEQ ID NO: 232) 

MSAIAVSMAMTOGSLPSESRAPRSAPRFRNRQASLERRARVSR^ 

PRPALSAWFTFAPFWRRNSQARRE FCMEKSRRGVEGGI PSGGFQDVLGWRQFREWEGGVW 

Human CENTB1 pray sequence - varl (SEQ ID NO: 41) 

GCCTGGAGTACCCATACGACGTACCAGATTAra 

CAGTGGTATCAACGCAGAGTGGCCATTATGGCCGGGGAAGGAGGCCTGGATTCACGCTAAATACGTGGAG 
AAGAAGTTCCTGACCAAGCTGCCTGAGATTCGAGGGCGAAGAGGTGGCCGGGGGCGCCCAAGGGGGCAGC 
CTCCTGTGCCCCGAAAGCCTTCCATCAGGCCCCGGC 
TGAGGACCTGGGAAGCCTGCACCCTGGGGCCCTACTGTT^ 

ACCATGTCGGCCGCCTCGGCCTCTAGAGGGTGGGCATCGATACGGGATCCATCGAGCTCGAGCTGCAGAT 
GAATCGTAGATACTGAAAAACCCCGCAAGTTCACTTCAACTGTGCATTCGTGC 

Human CENTB1 pray sequence - var2 (SEQ ID NO: 42) - 

CCGGCATGAGTACCATAACGACGTACAGATTACAGCGTNCATATAGTGACCATGGAGGCAGTGAATTCCA 
CCGCAAGCAGTGGTATCAACGTCATGAGATGGACCATTATGAGCCGGGGTGGGCAGCCTCCTGATGTCCC 
CGCGAAAGGCCTTCGATCAGGCNCCXSGCAGAGGCAGCTTGAGATCCAAGCCAAGAGCCCCCCTCTGAGG^ 
CCTGGGTAAGACCTGCTACCACTAGTGCGGCCCTrACTGTTN^ 

CAACCGATGGNCTGATGCCCTTTGGCGCC^TGGTAGCTTGA.TGTCAACCTAGGTGTACAANTGTGAGTGG 
CCTNAAGGATAAATTGCTCGTCGACC^GACCGGCTATCCA^GGCACAATAATCTAGCTAATTCGTTACG 
TTCTTGG 

Human CENTB1 pray sequence - var3 (SEQ ID NO: 43) 

CCTGGAGTACCCATACGACGTACCAGATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCACCCAAGC 
AGTGGTATCAACGCAGAGTGGCCATTATGGCCGGGGGGGCAGCCTCCTGTGCCCCCAAAGCCTTCCATCA 
GGCCCCGGCC^GGGAGCTTGAGATCCAAGCC^GAGCCCCCCTCTGAGGACCTGGGAAGCCTGCACCCTGG 
GGCCCTTVCTGTTTCGAGCGTCTGGGCATC^ 

GCTGATGTCAACTGGGTCAATGGGGGCCAAGATAATGCCACACCGCCGATCCAGGCCACAGCTGCTAATT 
CTCTTCTGGCCTGTGAGTTTI^GCTCCAGAACGGGGCGAACGTGA^^ 

CCCGCTGCACC^CGCAACCATTCTTGGCCACACGGGGCTCGCCTGCCTGTTCCTGAAACGGGGAGCGGAT 
CTGGGGGCTCGAGACTCTGAAGGCAGGGACCCTCTGACGATCGCCATGGAAACAGCCAACGCTG^ 
TCACCCTGCTACGACTGGCAAAGATGAGGGAGGCTGAAGCGGCCCAGGGGCAGGCAGGAGATGAGACGTA 
TCTTGACATCTTCCGCGACTTCTCCCTCATGGC^ 

GACCTCCACACGCTGTGACCCGAGGCCCACGGGGCCCGCGCCTGCCTTC^TTTCCCGNCACCGGGCCCTT 
TGNCATNAAAGCCTNCGNGCTTCNAAAAAAAAAAAAAAAAAAAA 

Hitman CENTB1 pray sequence - var4 (SEQ ID NO: 44) 

CCGGCCATGGAGTACGATACGACGTACAGTATTACAGCTACATATGGCCATGGAGGCCAGTGAATTCCAC 
CGCAATGCAGTGGTATCAACGCATGCAGATGGACCATTATGGCCGGGGTGGGCACGTCCGTCCATGATGT 
CCCCCAAAGGCCTTCCATCAGGCCCCGTGGCAGAGGAGGCTTGAGATCCAAGCCAGAGCCCACCCTCTGA 
GGACCTGGGAAGCCTGCACCCNGGCGGC<rCTACTGTTTCGAGCGTCTGGGACATACTCC^TCATCTTCCC 
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ACGCGATGGACTGATGCCCTTGGGCCAATGGACra2TGATC 

GATTAACTGCTCATCACCCGATGATCCATGGCCACTAGTCT^ 

CTCCTCACAGAAACGGTGCGCTGCAATCGTGAACNCAAAGCG<^ 

TTGCACCGATCGCAAGCCAATTCITGGCCANCT 

GCTGATCTGGGGCGCTCGACGACTCTGAAG 

Human CENTB1 pray sequence- var5 (SEQ ID NO: 45) 

GCCATGGATACCATACGACGTAGAGATTACGCTCA^ 

TGGTATCAACGCATGAGATGGTCATTATGGCCGGGGCAGGAGAA^ 

TGGAGAAGAAGTTCCTGACCAAGCTGCCGTGAGAATTCGAGGGCGAA 

GGGCAGCCTCCTGTGCCAGCCCTAAAGCCTTCCATCATGGCCCCGCG 

CCGAGTAGGCCCCCTCTGACGGACCTAGGGAAGC^ 

TGGGCATCCTCCTVTCTCTTTCCACCATGGCCTGATGCCCT^^ 

TGGGTGGCCAAGATATATGCGAC^CCGCTGATCC^^ 

NTNTTCTCCTCCAGAACGGTGGCGACACGTGAACCCAAGCGGNCAGTGCCGGC 

Human CENTB1 pray sequence - var6 (SEQ ID NO: 46) 

GGCCATGGAGTACCATACGACGTACAGATTACGCTCATATGGCC^^ 
GCAGTGGTATCAACGCATGAGATGGACCATTATGGGGGGCAGT 

CCAGCIXjCTCCCGGCAGGAGAAGGAGGCCTGGATTCACGGCATAATAGTAGCAGCT^ 

CTGTATCCAAGTCTGCCCTGACGAATTCGAGGTGGCGAAGTATGGTGGCCGGGGGGAGTCCTCGAAGGAG 

GGTCAGCCACTCCTGGTGCCGCC^CGAACATGCCCGT^ 

ATTGAGATCCACATGCACAGAGCCCCGCCTCTGAGGACCTGTGAGGAAGCrCATGGCAACCCT 

CTAGCGTAGTATTCTGAGCCAGTCTGGGCAATCGCTTCACATCTCTTCTCCACGCATGAGCATGATGCGC 

GCTTTGACCCATGGAGCTAGATGTCAACTGGGTCAATGGGGTGCCAAGATAATCGCCACACC 

CAAGGCCTACAGCTGCTAACGTTCCTCTTCTGGCCTGTGAGTTTCTCCTCTCAGAACGGGGCGAACTGTC 

AAGCCC^GCGTGACAGTGCGGGCCCGGGGCCCGACTGCGCCACGCAATCCACTTCTTGGCCNGCAACOT 

GGGCTCGNCTTGCCCTGTTTCCTTGATCAC 

Human CENTB1 pray sequence - var7 (SEQ ID NO: 47) 

CNCGGCATGGAGTACCATACGACGTACAGATTACGCTCATATGGCCATGGAGGCAGTGAATTCCACCCAA 

GGAGTGGTATGAACGCA/TGAGTGGACCATTATG^ 

TCAACCAAGATCTATGAGGCCCGCGTGGAGGCCATGGCAGTG^ 

CAGGAGAAGGAGGCCTGGATTCACGCTAAATACGTGGAAGAAGAAGTTCCTGACCAAGCTGCCTGAGATT 
CGATGGCGANGAGGTGGCCGGGGGCGCCCAANGGGGCAGNCTCCTGTGCCCCCAAAGCCTTCCATCAGGC 
CCCAGGCGCAGGGAGCTTGAGLATCCAATGCCAGAGCCCCCCGTCTGAGGACCTGGGAAGCCTGCACCCTG 
GGGCCCTACTGTTTCGAGCGTCTGGGC^TCCTCCATCTCTTCCGACCATGGCTGATGCCCTTGCCCATGG 
AGCTGATGTCAACTGGGTCAATGNGGCGGCCAAGATAATGCCATCACCGACTGATCCAGGCCACAGCCTG 
CTAANTTCTACTTCTGGCCGTGTGAGTTTCTCCTCCAGGAACGGGGCGAACCGTGGACCAAGGCGGACNN 
GTGCGGGCCGGGGCCCGCTGCCACCACGCCAACCATTCTTGGCATACGGGGCTCGCCT 

Unigene Name; DDEF1 Unigene ID: Hs. 386779 

Human DDEF1 mRNA sequence - varl (public gi: 31873727) (SEQ ID NO: 48) 

GAGACAAAGTTTACAAAAATTGAGAAAGAGAAAAGAGAGCACGCAAAACAACATGGGATGATC 
AGATAACAGGAGCTGAGATTGCGGAAGAAATGGAGAAGGAAAGGCGCCTCTTTCAGCTCCAAATGTGTGA 
ATATCTCATTAAAGTTAATGAAATCAAGACGAAAAAGGGTGTGGATCTGCTGCAGAATCTTATAAAGTAT 
TACCATGCAC^GTGCAATTTCTTTCAAGATGGCTTC 

AACTGGCTGCTGATTTATATAATATAAAACAGACCCAGGATGAAGAAAAGAAACAGCTAACTGCACTCCG 

AGACTTAATAAAATCCTCTCTTCAACTGGATCAGAAAGAAGATTCTCAGAGCCGGCAGGGAGGATACAGC 

ATGCATCAGCTCCAGGGCAATAAGGAATATGGCAGTGAAAAGAAGGGGTACCTGCTAAAGAAAAGTGACG 

GGATCCGGAAAGTATGGGAGAGGAGGAAGTGTTCAGTGAAGAATC 

ATCTAACAGGGAACCAGCCAAGTTGAACCTTCTCAC 

TCTTTTGACCTGATATCACATAATAGAACATA 

GGATATCAGTATTGACAAATAGCAAAGAAGAGGCCCTAACCATGGCCTTCCGTGGAGAGCAGAGTGCGGG 
AGAGAACAGCCTGGAAGAC CTGACAAAAGCCATTATTGAGGATGT CCAGCGG CTCC CAGGGAATGACATT 
TGCTGCGATTGTGGOTCATCAGAACCCACCTGGCTTTCAACCAACTTGGGTAT 

GTTCTGGCATCCATAGGGAAATGGGGGTTCATATCTCTCGCATTC^GTCTTTGGAACTAGACAAATTAGG 
AACTTCTGAACTCHTGCTGGCCAAGAATGTAGGAAACAATAGT^ 
CCC^GCCCCTC^CCAAAACCCACCCCTTCAAGTGATATGACTGTAC^ 
ATGTAGATCATAGGTTTTCAAGGAAGACCTGTTCAAC^ 
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.CATCAAATCCAGGGATTTACTTGCACTAA 
CTGGAACCTGGGCAGGAGCTTGGGGAGACAGCCCTTCACC^ 
TCCATTTGGTTGACTTCCTTGTACAAAACTGTGGGA 
TCTACACTACTGTAGTATGTACAGTAAACCTGAGTGTTT^ 

GATATAGTTAACCAGGCTGGAGAAACTGCCCTAGACATAGCAAAGAGACTAAAAGCT 

ATCTOCTTTCCCAGGCTAAATCTGGAAAGTTCAATCCACACGTC 

ACAGGAGGAGATAGATGAGAGCGATGATGATCTGGATGACAAACCAAGCCCT 

CCC^GACCTCAGAGCTTCTGCCACTCCTCCAGCATCTCCCCCCAGGACAAGCTGGC^ 

GCACTCCAAGGGACAAAXIAGCGGCTCTCCT 

AGACTCGCCC^CATCACCAACCACGGAGGCTCC^ 

GGCCCACCTTCAACACTCCCTCTAAGCACCCAGACCTCTAGTGGCAGCTCCACC 

CTCCTCCCCCACCACCCGGAmO^GAGAACCCTATCCGACCCTCCCAGCCCACTACCT 

AAACAAAGGCGCAGTTCCTTGGGGTAACGATGGGGGTC 

GAGGGACTATCCCAGCAGTCGAGCACCAGTTCTGCAAAGACTC 

TACCTCAGAAAGTGGCACTAAGGAAAACAGATCATCTCT 

CTTTCAGAAATCATCACAGTTGGCAGAGTTGCCACAAAAGC 

ACAGAACTGGCCCCCAAGCCCCAAATTGGAGATTTGCCGCCTAAGCra 

AGCTGGGGGACCTGCCACCCAAACCCCAACTCTCA^ 

CCCCAAACCACAGCTGGGAGACCTGCTAGCAAAATCC^ 

CCCTCTGAGGTC^CACTGAAGTCACACCCATTGGATCT 

AAAAGCAAGCATCTGAAGACTCCAACGACCTC^CGCCTACTCTGCCAGAGACGCCCGTACCACTGCCCA^ 

AAAAATCAATACGGGGAAAAATAAAGTGAGGCGAGTGAAGACCATTTATGACTGCC^ 

GACX3AGCTCAGATTCATCGAGGGAGAAGTGATTA 

•GCCACATCGAAGGACAGCCTGAAAGGAAGGGGGTCTTTCCAGTGTCCTTTGTTCA 
GCAAAACGCA.GAACCTTAAGATTGTCCAGATCCTTCATGCAAGACTGC 

CAGTGTGTATATAGCTGCTGTTACAGAGTAAGAAACTCATGGAAGGGCCACCTCAGGAGGGGGATATAAT 

GTGTGTTGTAAATATCCTGTGGTTTTCTGCCTTCACCAGTATGAGGGTAGCCTCGGACCCGGCGCGCCTT 

ACTGGTTTGCCAAAGCCATCCTTGGCATCTAGCACT^ 

CAAAAATAGGAGTATAGGAACTGCTGGCTTTGCAAATAGAAGT 

GAATTGACTCTGTTCCTAACAATGCAGTATTCTCAATTGTGTTACT 

TGGGTTCTGTTTTCCAGGTGAAACTTTTAGCTCCATGACAGACCAGCCTGTAGTTATOTGTGTAC^ 

TTACAGCTACAAAAACCTACTTTGGTATTTATTACAGAAAAGTGCTCAGTTAAATGTAAGTGTT^ 

TTCAGCAAAATATTCACTGACCCAAAACTCTTTAT 

GACAAGTGGATTTATACTGTCTTATGAGTGCCCGCCCCTGATATATTACCTCATTATGCAA/^AATAACAT 

ATCTTTCATGACTATTTTGACAAAAGTTTAAAACACATATGAAGTTCAAATTTCAGGAACC^ 

CAGAAAATATTAGCCTCTAGATTACGCATGCATTTAGAAAC 

TAGTATGGATAAGTGGAATTGTACATTTTTTAAACTTGATTGCCATTAAAGCAGAAATTA 
AC^TATTTGTTTCrAATCACTGGCTTTCTCAAGAGTATGGATTGACATATTGTGOT 
TCTCAGATGTGTTGAAGCATCGATTGCATCCATTTTTTATTATTTTCTTAGTTTTGTO 
. TAAACTTTTAAAAGATTATTCAAGATGAATTTAAAAGTCAACCCTTCACACAGTTTCCCTACrGTATGTA 
GAATCC^GGTGCTGAAACCAAGTGTTTCTTTTCCCATGCTCTTTGTTAAACCCCAATO 
TCCAGTCTTAAGCTCTGTCCACCTTCAAGTGAATTCATAACCAAGTTTTTGAACGCTGCT 
CTGTGAAAAGCACTCTTCCCTCTCAGTTTTCTTT^ 
TGTATTTGAGTCTTTTACATCAGTCTGA^ 

AACTGTTCCCTGAGTCCGAGGCTCATGTGTCATTCTGGCACTACATTTGCTTAAATTGCTATTTTGGCAA 

CAGCACAGAAAACTAATATTTTTAAGCAGAGAATCTTGGCAATG 

CACAACTCCCAACCCAACCCTTAGGAAAAGCCCTCT 

GTTOTGCTTAAGTGGTTGCTATACAAACTTTGJ^TAGCCACCTAATAAATAAACCTTG 

GCAAAATATTTTATCAGCTGTTATTGGAAAGTGA^ 

TGAATTTCGACACCAAACAGAGCATG^^ 

CCTCGGTACTGAAGCCACACCAGTGTCCGGATGGAAGTCTGCATCTGAGGTTGCTCAGTGTCCCGGTCAT 

TCATTTACACATTTTAACTTGCATTAAAGAGCTGTTCTTTTCTGTGGCCTAGA 

AAAATAAACTGGTTTTTTTCAAAAAAAAAAAAAAAAAAAA 

Human DDKF1 raRNA sequence - var2 (public gi: 633 0853) (SEQ ID NO: 49) 

GAAAAGAGAGCACGCAAAACAAC^TGGGATGATCCGCACAGAGATAACAGGAGCTGAGATTGCGGAAGAA 

ATGGAGAAGGAAAGGCGCCTC^TCAGCTCCAAATGTGTGAATATCTCATTAAAGTTAATGA 

CCAAAAAGGGTGTGGATCTGCTGCAGAATCTTATA^ 

TGGCTTGAAAACAGCTGATAAGTTGAAA^GTACATTGAAAAACTGGCTGCTGATTTATATAA 

CAGACCCAGGATGAAGAAAAGAAACAGCTAACTGC^CTCCGAGACTTAATAAAATCCTCTCTTCAACTGG 

ATCAGAAAGAATCTAGGAGAGATTCTC^GAGCCGGCAAGGAGGAT^ 

TAAGGAATATGGCAGTGAAAAGAAGGGGTACCTGCTAAAGAAAAGTGACGGGATCCGGAAAGTATGGCAG 
AGGAGGAAGTGTTC^GTCAAGAATGGGATTCTAACCATCTCACATGCCACATCTAAC^ 
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AGTTGAACCITCTCACCTGCCAAGTAAAACCTAATG 
TAATAGAACATATGACTTTCAGGCAGAAGATGAGCAGGATTAT^ 
AGCAAAGAAGAGGCCCTAAOCATGGCCTTCCXSTGGAGAGCAGAGTGC^ 
TGACAAAAGCCATTATTGAGGATGTCCAGCGGOTCCCAGGGAA 

AGAACCCACCTGGCTTTCAACCAACTTGGGTATTTTGACCTGTATAGAATGTTCTGG 
ATGGGGGTTCATATTTCTCGCATTCAGTCTTTGGAAC^^ 

CCAAGAATGTAGGAAACAATAGTTTTAATGATATTATGGAAGCAAATTTACCCAGCCCCT 
CACCCCTTCAAGTGATATGACTGTACGAAAAGAATATATCACTGC 
AGGAAGACCTGTTCAAOTTCATCAGCTAAACTAAATGAATTGCTTGAGGCCAT 
TTGCACTAATTCAAGTCTATGCAGAAGGGGTAGAGCTAAT 

TGGGGAGACAGCCCTTCACCTTGCCGTCCGAACTGCAGATCAGACATCTCTCCATTTC 
GTACAAAACTGTGGGAACCTGGATAAGCAGACX3GCCCTGGGAAACACAGTTCTACACTACT 
ACAGTAAACCTGAGTGTTTGAAGCTTTTGCTCAGGAGCAAGCCCA 
AGAAACTGCCCTAGACATAGCAAAGAGACTAAAAGCTACCCAGTGTGAAGAT 

TCTGGAAAGTTCAATCCACACGTCCACGTAGAATATGAGTGGAATCTTCGACAGGAGGAGATAGATGAGA 
GCGATGATGATCTGGATGACAAACGAAGCCCTATCAAGA^ 

CCACTCCTCCAGCATCTCCCCCCAGGACAAGCTGGCACTGCCAGGATTCAGCACTCCAAG 

CGGCTCTCCTATGGAGCCTTCACCAACCAGATCTTCGTTTC 

CCACGGAGGCTCCCCCTCTGCCTCCTAGGAACGCCGGGAAAGGTCCAACTG 

TCTAAGCACCCAGACCTCTAGTGGCAGCTCGACCCTATCCAAGAAGAGGCCTCCTCCCCCACCACCCGGA 
CACAAGAGAACCCTATCCGACCCTCCCAGCCCACTACCTCATGGGCCCCCAAACAAAGGCGCAGTTCCTT 
GGGGTAACGATGGGGGTCCATCCTCTTCAAGTAAGACTACAAACAAGTTTGAGGGACTATCCCAGCAGTC 
GAGCACCAGTTCTGCAAAGACTGCCCTTGGCCCAAGAGTTCTTCCTAAACTACCTCAGAAAGTGGCACTA 
AGGAAAACAGATCATCTCTCCCTAGACAAAGCCACCATCCCGCCCGAAATCTTTCAGAAATCATCACAG^ 
TGGCAGAGTTGCCACAAAAGCCACCACCTGGAGACCTGCCCCCAAAGCCCACAGAACTGGCCCCCAAGCC 
CCAAATTGGAGATTTGCCGCCTAAGCCAGGAGAACTGCCCCCCAAACCACAGCTGGGGGACCTGCCACCC 
AAACCCCAACTCTCAGACTTACCTCCCAAACCACAGATGAAGGACCTC 

ACCTGCTAGCAAAAT.CCCAGACTGGAGATGTCTCACCCAAGGCTCAGCAACCCTCTGAGGTCACACTGAA 
GTCACACCCATTGGATCTATCCCGAAATGTGCAGTCCAGAGACGCCATCCAAAAGCAAGCATCTGAAGAC 
TCCAACGACCT(^CGCCTACTCTGCCAGAGACGCCCGTACCACTGCCCAGAAAAATCAATACGGGGAAAA 
ATAAAGTGAGGCGAGTGAAGACCATTTATGACTGCCAGGCAGACAACGATGACGAGCTCACATTCATCGA 
GGGAGAAGTGATTATCGTCACAGGGGAAGAGGACCAGGAGTGGTGGATTGGCCACATCGAAGGACAGCCT 
GAAAGGAAGGGGGTCTTTCCAGTGTCCTTTGTTCATATCCTGTCTGACTAGCAAAACGCAGAACCTTAAG 
ATTGTCCACATCCTTCATGCAAGACTGCTGCCTTCATGTAACCCTGGGCACAGTGTGTATATAGCTGCTG 
TTACAGAGTAAGAAACTCATGGAAGGGCCACCTCAGGAGGGGGATATAATGTGTGTTGTAAATATCCTGT 
GGTTTTCTGCCTTCACCAGTATGAGGGTAGCCTCGGACCCGGCGCGCCTTACTGGTTTGCCAAAGCCATC 
CTTGGCATCTAGCACTTACATCTCTCTATGCTGTTCTACAAGC?UUVCAAACAAAAATAGGAGTATAGGAA 
CTGCTGGCTTTGCAAATAGAAGTGGTCTCCAGCAACCGTTGAAAGGCATAGAATTGACTCTGTTCCTAAC 
AATGCAGTATTCTGAATTGTGTTACTGAAAATGCAACATTAGCAAAGAGGTGGGTTCTGTTTTCCAGGTG 
AAACTTTTAGCTCCATGACAGACGAGCCTGTAGTTATCTGTGTACACAGTTTACAGCTACAAAAACCTAC 
TTTGGTATTTATTAC^GAAAAGTGCTCAGTTAAATGTAAGTGTTATTCCTTCAGGAAAATATTCACTGAC 
CCAAAACTCTTTATGGCATTTTACAATGCACAGAGCCTCATGCAAGTTTAGACAAGTGGATTTATACTGT 
CTTATGAGTGCCCGCCCCTGATATATTACCTCATTATGCAAAAATAACATATCTTTC^TGACTATTTTGA 
CAAAAGTTTAAAAC^CATATGAAGTTCAAATTTCAGGAACCAAGGACTGCCAGAAAATATTAGCCTCTAC 
ATTACGCATGCATTTAGAAGCTTACCTGAAATCTGCCTTTTATAAAGGAATAGTATGGATAAGTGGAATT 
GTACATTTTTTAAACTTGATTGCCATTAAAGCAGAAATTATAAGGTTC 

CTOGCTTTCTCAAGAGTATGGATTGACATATTGTGTTATGAATGCACATCTCTCAGATGTGTTGAAGCAT 

CCATTGCATCCA.TTTTXTATTATTTTCTTAGTTTTGTTCTTGGACAAATTTAAACTTTTAAAAGATTAT 

CAAGATGAATTTAAAAGTCAACCCTTCACACAGTTTCCCTACTGTATGTAGAATCCAGGTGCTGAAACC^ 

AGTGTTTCTTTTCCCATGCTCTTTGTTAAACCCCAATTATAGATAATTTTTCCAGTCTTAAGCTCTG 

ACCTTCAAGTCAATTCATAACCAAGTTTTTGAACGCTGCTATGAATTGCACTGTGAAAAGCACTCTTCCC 

TCTCAGTTTTCTTTTCATCCCAGCCATGTTTAT^ 

CAGTCTGAATTTTGGAAAAGAATGCAATAGTTGTACTCCACAGTCAGTGGAACTGTTCCCTGAGTCCGAG 
GCTCATGTGTCATTCTGGCACTACATTTGCTTAAATTGCTATTTTGGCAACAGCACAGAAAACTAATATT 
TTTAAGCAGAGAATCTTGGCAATGAGTGAGAGATGTTAATTTCACAGAAGCACAACTCCCAACCCAACCC 
TTAGGAAAAGCCCTCITCCATCGTTACAGTGCTCAGTGAATATTAATTTAGTTCTGCTTAAGTGGTTGCT 
ATACAAACTTTGAATAGCCACCTAATAAATAAACCTTGCAT 

TTATTGGAAAGTGATTTTAAGCAATTGCTTCCTCAGTGTCAGGGCACATGTGAATTTCCACACCT^ACAG 
AGCATGAGGAACCAGOTGACATGCTGGGTTGTGACTGGCAGCTTTAGCAGCCTCGGTACTGAAGCCACAC 
CAGTGTCCGGATGGAAGTCTGCATCTGAGGTTGCTCAGTGTCCCGGTCATTCATTTACACATTTTAACTT 
GCATTAAAGAGCTGTTCTTTTCTGTGGCCTAGACTCTTTTCACTGATCTCAAAATAAACTGGTTTTTTTC 
AAAAAAAAAAAAAAAAGAAAAACAAAAAAAAAACACAAAAGC TG CATGT CTAAAATTACATGG AGT T AG T 



Figure 36 part -25 



70/202 



WO 2004/078130 



PCT/US2004/006308 



TTCGGTTTTGTTTTCCATCAGGAATTTGAGTTCTOTCT 

AGTAGAAATACCTTTGGTGATCTTGTTGAGT^ 

CTGCAATTTTGTACATTATATAGTATTTTGAAGAT^^ 

CTTTTTCCCCGAGAGGGGAAAGTTATGTTCTGCAAATAGTGTGTGTCTTATTT^ 
TGCTATTTATTTTTTTATTGCCT^ 

TGCCGAATTCTCATCTGGATGTTACCAT CAAACATCAGTACACTTGT CATTTCACATGTGTTTAATGTGA 

CAGTTTTTCAGTACTGTATGTGTTAATTTCTACT 

TTCTCAGTTGATCCC 

Human DDEF1 mRNA sequence - var3 (public gi: 7689053) (SEQ ID NO: 50) 

GATTGCCATTAAAGCAGAAATTATAAGGTTGCAACAATATTT 
ATGGATTGACATATTGTGTTATGAATGCACATCT^ 

TATTA1TTTCTTAGTTTTGTTCTTGGACAAATTTAAACANNTTAAAAGA 
GTCAACCCTTCACACAGTTTCCCTACTGT^ 

TGCTCTTTGTTAAACTCC^TTATAGATAATTTTTCC^GTCTTAAGCTCT 

ATAACCAAGTTTTTGAACGCTGCTATGAATTGCACTGTGAAAAGCACTCTTCCOT 

TCCTGAGCCAGAATCAAAAAAAAAAA 

Human DDEF1 mRNA sequence - var4 <pubHc gi: 16552319) (SEQ ID NO: 51) 

CAGAACCTTAAGATTGTCCACATCCTTCATGCAAGACTGOT 

ATATAGCTGCTGTTACAGAGTAAGAAACTCATGGAAGGGCCACCTCAGGAGGGGGATATAATGTGTGTTG 

TAAATATCCTGTGGTTTTCTGCCTTCACCAGTATGAGGGTAGCCTCGGACCCX^CGCGCCTTACTGGTTT 

GCCAAAGCCATCCTTGGCATCTAGCACTTACATCT 

GGAGTATAGGAACTGCTGGCTTTGCAAATAGAAGTGGTCT^ 

TCTGTTCCTAACAATGCAGTATTCTCAATTGTGTTACT 

GTTTTCCAGGTGAAACTTTTAGCTCCATGAC^GACCAGCCTGTAGTTATCTGTGTACACAGTTTACAGCT 

ACAAAAACCTACTTTGGTATTTATTACAGAAAAGTGCTCAGT^ 

ATATTCACTGACCCAAAACTCTTTATGGCATTTTACAATGCACACAGCCTCAT 

GATTTATACTGTCTTATGAGTGCCCGCCCCTGATATA^ 

TGACTATTTTGACAAAAGTTTAAAACACATATGAAGTT 

ATTAGCCTCTACATTACGCATGCATTTAGAAGCTTACCTGAAATCTGCCTTTTATAAAGGAATAGTATGG 
ATAAGTGGAATTGTAGATTTTTTAAACTTGATTGC 

TGTTTCTAATCACTGGCTTTCTCAAGAGTATGGATTGACIATATTGTGTTATGAATGCACATCTCTCIAGAT 
GTGTTGAAGCATCCATTGGATCCATTTTTTAT^ 

TAAAAGATTATTCAAGATGAATTTAAAAGTCAACCCTTCACACAGTTTCCCTACTGTATGTAGAATCCAG 
GTGCTGAAACCAAGTGTTTCTTTTCCCATG^^^ 

TAAGCTCTGTCC71CCTTCAAGTCAATTCATAACCAAGTTTTTGAACGCTGCTATGAATTGCACTGTGA^ 
AGCACTCTTCCCTCTCAGTTTTCTTTT^ 

AGTCTTTTACATCAGTCTGAATTTTGGAAAAGAATGCAATAGCT 

CCTGAGTCCGAGGCTCATGTGTCATTCTGGCACTACATTTGCTTAAATTGCTATTTTGGCAACAGCACAG 

AAAACTAATATTTTTAAGCAGAGAATCTTGGCAATGAG 

CCAACCCAACCCTTAGGAAAAGCCCTCTTCCATCG 

TAAGTGGTTGCTATACAAACTTTGAATAGCCACOTAATAAATAAACCTTGCATGACAAACCTGCT 

TTTTATC^GCTGTTATTGGAAAGTGATTTTAAGGA^ 

CACACCAAACAGAGCATGAGGAACCAGTTGACATGCT^ 

CTGAAGCC^CACCAGTGTCCGGATGGAAGTCTGCATCTGAGGTTGCTC^GTGTCCCG 

ACATTTTAACTTGCATTAAAGAGCTGTTCTTTTCTGTGGCCTAGACTCTTTTCACTGATCTCAAAATAAA 

CTGGTTTTTTTC 

Human DDEF1 mRNA sequence - var5 (public gi: is 08 8 8 17) (SEQ ID NO: 52) 

CAGCTACAAAAACCTACTTTGGTATTTATTAC^ 
GCAAAATATTGACTGACCCAAAACTCTTTATGGGAT 

AAGTGGATTTATACTGTCTTATGAGTGCCCGCCCCTGATATATTACCTCATTATGCAAAAATAACATATC 

TTTCATGACTATTTTGACAAAAGTTTAAAACACATATGAAGTTCAAATTTCAGG 

A7^AATATTAGCCTCTACATTACGCATGCATTTAGAAGCTTACCTGAAATCT 

TATGGATAAGTGGAATTGTACATTTTTTAAACTTGATTGCCATTAAAGCAGAAATTATAAGGTTGCAACA 
ATATTTGTTTCTAATCACTGGCTTTCTC^ 

CAGATGTGTTGAAG(^TCCATTGCATCCATTTTTTATTATTTTCTTAGTTTTGTTCTTG 

ACTTTTAAAAGATTATTCAAGATGAATTTAAAAGTCAACCCTTCACACAGTTTCCCTACTGTATGTAGAA 

TCCAGGTGCTGAAACCAAGTGTTTCTTTTCCCATG 

AGTCTTAAGCTCTGTCCACCTTCAAGTCAATTCATAACCAAGTTTTTGAACGCTGCTATGAATTGCACTG 
TGAAAAGCACTCTTCCCTCTCAGTTTTCTTTTC^ 
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ATTTCAGTCTTTTACATCAGTCTGAATC 

TGTTCCCTGAGTCCGAGGCTCATGTGTCATTCTGGCACTACAT^ 

CACAGAAAACTAATATTTTTAAGCAGAGAATC^ 

AACTCCCAACCCAACCCTTAGGAAAAGCC 

CTGCTTAAGTGGTTGCTATACAAACITTGAATA 

AAATATTTTATCAGCTGTTATTGGAAAGTGATT 

ATTTCCACACCAAAC^GAGCATGAGGAACCAGT^ 

CGGTACTGAAGCCACACCAGTGTCCGGLATGG 

TTTACA^TTTTAACITGCATTAAAGAGCTGTTCTTTTCTGTGG 

ATAAACTGGTTTTTTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human DDEFl raKNA sequence - var6 (Predicted by Proteologics) (SEQ ID NO: 53) 

TTTCGACGCGTGGGTTTTATTCCCTTGAAGA 

GCCTCAAGAAGCATGCAGGAGCCATCATAAGAGTCACAAGGCTCTAG 
AAAGTGAAGAAGTCTGTAAAAGCAATATATAATTCT^ 
CACAAGTTCTTGATAAGTTTGGGAGTAATTTTTTAAGTC 
CAAGTTTTCTACTCTTACAAAGGAACTGTCC^ 

GTGATCTTCACCTTGGATT CTTTGTTAAAAGGAGAC CTAAAGGGAGTCAAAGGAGATCTCAAGAAGCCAT 

TTGAGAAAGCCTGGAAAGATTATGAGAGAAAGTTT 

ACAAGATGGGATGATCCGCACAGAGATAA<^GGAG 

CTCTTTCAGCTCCAAATGTGTGAATATCTCATTAAAGTTAATGLAAATCAAGACCAAAAAGG 
TGCTGCAGAATCTTATAAAGTATTACCATGCACAGTGCAATTTCTTTCAAG^^ 
TAAGTTGAAACAGTACATTGAAAAACTGGCTGCTGATTTATATAATATAAAACAGACC 
AAGAAACAGCTAACTGCACTCCGAGACTTAATAAAATCCT 

GAGATTCTCAGAGCCGGCAAGGAGGATACAGCATGCATCAGCTCCAGGGCAATA 

AAAGAAGGGGTACCTGCTAAAGAAAAGTGACGGGATCCGGAAAGTATGGCAGAGGAGGAAGTGTTCAGTC 
AAGAATGGGATTCTAACCATCTCACATGCCACATCT^ 

GCC^GTAAAACCTAATGCCGAAGACAAAAAATCTTTTGACCTGATATCACATAATAGAACATATCACTO 

TCAGGCAGAAGATGAGCAGGATTATGTAGCATGGATATCAGTA 

ACC^TGGCCTTCCGTGGLAGAGCAGAGTGCGGGAGAG 

AGGATGTCC^GCGGCTCCCAGGGAATGACATTTGCTGCGATTO 

AACCAACTTGGGTATTTTGACCTGTATAGAATGTTCTC 

CGCATTCAGTCTTTGGAACTAGACAAATTAGGAACTTCT^ 

ATAGTTTTAATGATATTATGGAAGCAAATTTACCCAGCCC^ 

GACTGTACGAAAAGAATATATCACTGCAAAGTATGTAGATCATAGGT 

TCATCAGCTAAACTAAATGAATTGCTTGAGGCCATCAAATCCAGGGATTTACTTGCACTAATT 
ATGCAGAAGGGGTAGAGCTAATGGAACC^CTGCTGGAAC 

CCITGCCGTCCGAACTGCAGATCAGAC^TCTCTCCATTTGGTTGACTTCCTTGTACAAAACTGTC 
CTGGATAAG CAGACGGCCCTGGGAAACACAGTT CTACACTACTGTAGTATGTACAGTAAAC CTGAGTGTT 
TGAAGCTTTTGCTC^GGAGCAAGCCCACTGTGGATATAGTTAACCAGGCTGGAGAAACTGCCCTAGACAT 
AGCAAAGAGACTAAAAGCTJMTCCAGTGTGAAGATCTGCTTT^ 

CACGTCCACGTAGAATATGAGTGGAATCTTCGACAGGAGGAGATAGATGAGAGCGATGATGATCTGGATG 

ACAAACCAAGCCCTATCAAGAAAGAGCGCTGACCCAGACCT 

CCCCGAGGACAAGCTGGCACTGCCAGGATTCAGC^ 

TTCACCAACCAGATCTTCGTTTCCACT^ 

TGCCTCCTAGGAACGCCGK3GAAAGGTCCAACTGGCCCACCTTCAACACTCCCTCT 

TAGTGGCAGCTCCACCCTATCCAAGAAGAGGCCTCCTCCCCCACCACCCGGACACAAGAGAACCCTATCC 
GACCCTCCCAGCCCACTACCTCATGGGCCCCCAAA^^ 

CATCCTCTTCAAGTAAGACTACAAACAAGTTTGAGGGACTATCCCAGCAGT 

GACTGCCCTTGGCCCAAGAGTTCTTCCTAAACTACCTCAGAAAGTGGCACTAAGGAAAACAGATCATCTC 
TCCCTAGACAAAGCCACCATCCCGCCCGAAATCTTTCAGAAAT 

AGCCACCACCTGGAGACCTGCCCCCAAAGCCCACAGAACTGGCCCCCAAGCCCCAAATTGGAGAT7TTGCC 

GCCTAAGCCAGGAGAACTGCCCCCCAAACCACAGCTGGGGGACCTGCCACCC^ 

TTACCTCCCAAACCACAGATGAAGGACCTGCCCCCCy^^ 

AGACTGGAGATGTCTCACCCAAGGCTCAGCAACCCTCTGAGG 

ATCCCCAAATGTGCAGTCCAGAGACGCCATCCAAAAG^^ 

ACTCTGCCAGAGACGCCCGTACCACTGCCC^GAAAAATCAATACGGGGAAAAATAAAGTGAGGCGAGTGA 

AGACCATTTATGACTGCCAGGCAGACAACGATGACGAGCTCACATTCATCGAGGGAGAA 

C^CAGGGGAAGAGGAC(^GGAGTGGTGGATTGGCC^CATCGAAGGAC^GCCTGAAAGGAAGGGGGTCTTT 

CCAGTGTCCTTTGTTCATATCCTGTCTGACTAGCAAAACGCAGAACCTTAAGATTGTCCACATCCTTCAT 

GCAAGACTGCTGCCTTC^TGTAACCCTGGGCACAGTGTGT^ 

ATGGAAGGGC CACCTCAGGAGGGGGATATAATGTGTGTTGTAAATATCCTGTGGTTTT CTGCCTTC ACCA 
GTATGAGGGTAGCCTCGGACCCGGCGCGCCTTACTGGTTTGCCAAAGCCATCCTTGGCATCTAGCACTTA 
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CATCTCTCTATGCTGTTCTACAAGCAAACAAACAAAAATAG 

GAAGTGGTCTCCAGCAACTOTTGAAAGGCATAGAATTGACTCTGT^ 

GTGTTACTGAAAATGCAACATTAGCAAAGAGGTGGGTTCTGTTTT^^ 

CAGACCAGCCT^TAGTTATCTGTGTACACAGTTTAC^ 

AAAGTGCTCAGTTAAATGTAAGTGTTATTCCTTCAGC^^ 

TTTTAC^TGCACACAGCCTCATGG^GTTTAGA 

TGATATATTACCTGATTATGCAAAAATAACM 

ATGAAGTTCAAATTTCAGGAACCAAGGACTGCCAGAA^ 

AGCTTACCTGAAATCTGCCTTTTATAAAGGAATAGTATGGATAAGTGGAATTC 

ATTGCCATTAAAGCAGAAATTATAAGGTTGCAACAAT^^ 

TGGATTGACATATTGTGTTATGAATGCACATCTCTCAGATGTGTTGAAGC^ 

ATTATTTTCTTAGTTTTGTTCTTCGACAAATTTAAACTTTTAA 

CAACCCTTCACACAGTTTCCCTACTGTATGTAGAATCCAGGTC 

CTCTTTGTTAAACCCCAATTATAGATAATTT^ 

AACCAAGTTTTTGAACGCTGOTATGAATTGCACTGTGAAAAGCACTCTTCCCTCT 
CCCAGCCATGTTTATCAGATCCTTAAGAAG^ 

AGAATGCAATAGTTGTACTCCACAGTCAGTGGAACTGTTCCCTGAGTCCGAGGCT 
CACTACATTTGCTTAAATTGCTATTTTGGCAACAGCAGAGAAAACTAATATTTTTAAGCAG^ 
GCAATGAGTGAGAGATGTTAATTTCACAGAAGCACAACTC^ 
CATCGTTACAGTGCTCAGTGAATATTAATTT^^ 

CACCTAATAAATAAACCTTGCATGACAAACCTGCAAAATATTTTATCAGCTGT 
AAGCAATTGCTTCCTGAGTGTCAGGGGACATGTGAATTT^ 

ACATGCTGGGTTGTGACTGGCAGCTTTAGCAGCCTCGGTACTGAAGCCACACCA 
CTGCATCTGAGGTTGCTCAGTGTCCCGGTCATTCAT^ 

TTTCTGTGGCCTAGACTCTTTTCACTGATCTCAAAATAAACTGGTTTTTTTCAAA 
AAAACAAAAAAAAAACACAAAAGCTGCATGTCTAAAA.TTACATG 

CAGGAATTTGAGTTCTCTCTAACCCAGCTTAOTGTGGGACATAGGAAAACTCAGTAGAAATACCTTTGGT 

GATCTTGTTGAGTTTAAGTCTGATCTTGATCTTAAACTCAGTAAGCCACTATCTGCAATTTTGTACATTA 

TATAGTATTTTGAAGATATGGAACCTTATGAAAAAAAAATAGCAAATTAGTTCTTTTTCCCCCAGAGGGG 

AAAGTTATGTTCTGCAAATAGTGTGTGTCTTATTTTACTGTTC 

TGCCTAGAACTTC^CATGTTGTATAC^^ 

ATGTTACCATCAAACATCAGTACACTTGTCATTTCACATGTGTTTAATGTGAC^ 



Human DDEF 1 protein sequence - var 1 (public gi: 3187372 8) (SEQ ID NO: 233) 

ETKFTKI EKEKREHAKQHGMIRTEI TGAEI AEEMEKERRLFQLQMCEYDIKVNEI KTKKGVDLLQNLI KY 
YHAQOTFFQDGLKTADKLKQYIEKLAADLYNIKQTO 

MHQLQGNKE YGS E KKGYLLKKSDGI RKVWQRRKCSVKNG I LTI SHATSNRQPAKLNLIiTCQVKPNAEDKK 

SFDLISHNRTYHFQAEDEQDYVAWISVIiTNSKEEALTMAFRGEQSAGENSLEDLTKAIIEDVQRL 

CCDCGSSEPTWLSTNLGILTCIECSGIHREMGVHISRIQSLEI^ 

PSPSPKPTPSSDMTVRKEYITAKYVDHRFSRKTCSTSSAKI^ 

LEPGQEIK3ETALHIAVRTADQTSLHL^^ 

DIVNQAGETALDIAKRLKATQCEDLLSQAKSGKFNPHV^^ 

PRPQS FCHSSS I S PQDKLALPGFSTPRDKQRLS YGAFTNQI FVSTSTDSPTSPTTEAPPLPPRNAGKGPT 
GPPSTLPLSTQTSSGSSTLSKKRPPPPPPGHKRTLSDPPSPLPHGPPNKGAVPWGNDGGPSSSSKTTNKF 
EGLSQQSSTSSAKTALGPRVLPKLPQKVALRKTDHLSI^ 
TEIiAPKPQIGDLPPKPGELPPKPQLGDIiPPKPQIjSDLPPKPQMK^ 

PSEVTLKSHPLDLSPNVQSRDAIQKQASEDSNDLTPTLPETPVPLPRKINTGKNKVRRVKTIYDCQADND 
DELTFI EGE VI I VTGEEDQE WWIGHI EGQPERKGVFPVS FVHI LSD 

Human DDEF I protein sequence - var2 (public gi: 63 3 0854) (SEQ ID NO: 234) 

KREHAKQHGMI RTE ITGAE I AEEMEKERRLFQLQMCE YL I KVNE I KTKKGVDIiLQNLI KYYHAQCNFFQD 

GIiKTADKLKQYIEKLAADLYNI KQTQDEEKKQLTALRDLI KSSLQLDQKESRRDSQSRQGGYSMHQLQGN 

KE YGSEKKGYLLKKSDGI RKVWQRRKCSVKNGI LT I SHATSNRQPAKLNLLTCQVKPNAEDKKS FDLI SH 

NRTYHFQAEDEQDYVAWI SVLTNSKEEALTMAFRGEQSAGENSLEDLTKAI I EDVQRLPGNDI CCDCGSS 

EPTWIiSTNLGILTCIECSGIHREMGVHISRIQSLEIiDKLGTSE 

TPSSDMTVRKEYITAKYVDHRFSRKTCSTSSAKLNEL^ 

GETALHLAVRTADQTSIiHLVDFLVQ^ 

ETALDIAKRLKATQCEDLLSQAKSGKFNPHVHra 

HSSSISPQDKLALPGFSTPRDKQRLSYGAFTNQIFVSTSTDSPTSPTTEAPPLPPRNAGKGPTGPPSTLP 
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l^TQTSSGSSTLSKKRPPPPPPCraKRT^^ 

STSSAKTALGPRVLPKLPQKVMJE^KTDHLSIJDKAT 

QIGDLPPKPGELPPKPQIiGDLPPKPQLSDLPPKPQMKDL 

SHPLDLSPNVQSRDAIQKQASEDSNDLTPTLPETPVPLPRKINTGKNKVRRVOT 
GEVI IVTGEEDQEWWIGHIEGQPBRKGVFPVSFVHIIiSD 

Human DDEF1 protein sequence - var3 (public gi: 7689054) (SEQ ID NO: 235) 

MNAHLSDVLKHPIiHPFFIIFIiVLFLDKFKXXK^ 
Ij 

Human DDEF1 protein sequence - var4 (public gi: 18088818) (SEQ ID NO: 236) 

MNAHLSDVLKHPLHPFFI I FLVLFLDKFKLLKDYSR 

Human DDEF1 protein sequence - var5 (Predicted by Proteologics) (SEQ ID NO: 237) 

MIGQPQEACRSHHKSHKALDQDRTALQKVKKSVKAIYNSGQDHVQNE 
TAFVKFSTLTKELSTLLJCNLI^^ 

EHAKQHGMIRTE I TGAEI AEEMEKERRL FQLQMCE YLI KVNE I KTKKGVDLLQNLI KYYHAQGTFFQDGL 

KTADKLKQYIEKIiAADLYNIKQTQDEEKKQLTAIiRDLIKSSIjQLDQKESRRDSQS 

YGSEKKGYIiLKKSDGIRKVWQRRKCSVKNGILTISHATSNRQPAKL^ 

TYHFQAEDEQDYVAWISVLTNSKEEALTMAFRGEQSAGENSLEDLTKAIIEDVQRLPGNDICCDOTSSEP 

TWLSTNLGILTCIECSGIHREMGVHISRIQSLEIjDKIjGTSELLLiAKNVGNNSF 

SSDMTTOKEYITAKYVDHRFSRKTCSTSSAKIjNE 

ALDIAKRLKATQCEDLLSQAKSGKFNPHVHVEYEWNIiRQEElDESDDDLDDKTSPI 

SSISPQDKIALPGFSTPRDKQRIiSYGAFTNQIFVSTSTDSPTSPTTEAPPLPPRNAGKGPTGPPSTL 

TQTSSGSSTLSKKRPPPPPPGHKRTLSDPPSPLPHGPPNKGAVPWGNDGGPSSSSKTTNKFEGLSQQSST 

SSAKTALGPRVLPKLPQKVALRJCroHLSLDKATIPPEIFQKSSQLAELPQKPPP 

GDLPPKPGELPPKPQLGDIiPPKPQIiSDLPPKPQMKD^^ 

PLDIiSPNVQSRDAI QKQASEDSNDIjTPTLPETPVTPLPRKINTGKNKVRRVKTI YDCQADNDDELTFI EGE 
VI I VTGEEDQEWWI GH I EGQPERKGVFPVS FVHI LSD 

Human DDEF1 pray sequence - varl (SEQ ID NO: 54) 

GCGCCGCCATGGTAGTACCCATACGACGTACCAGTATTACGCTCATATGGCCATGGCAGGCCAGTGAATT 
CCACACCAAGCAGTGGTATCAACGCAGAGTGGG^^ 

AGCCCACAGAAACTGGCCCCCAAGCCCCAAATTGGAGATTTGCCGCCTAAGCCAGGAGAACrGCCCCC^ 

AAC(^CAGCTGGGGGACCTGCCACCCAAACCCCAACACTCAGACTTACCTCCCAAACCA(^ 

CCTGCCCCCCAAACCACAGCTGGGAGACCTGCTAGCAAAATCCCAGACTGGAGATGTCTCACCCAAGGCT 

GAGCAACCCTCTGAGGTCACACTGAAGTCAC^ 
CGATCCAAAAGCAAGG&TNTGAAGACTCG 

GCCCANAAAAAT CANTACGGGGAAAANTAAlSnsrrGAGGCGAGTGAAAAC CTTTAATGACTGC CAGGCANAC 
ANNATGACAAGCTCNATTCNTCNAGGGANAAGTGTTATCGTNCAGGGAAAAAGNNCNGGATTGTGGGTCC 

NNCAATTTTCNTCCNNTNNTCNNACTTATTANAATNGCNNGGC^ 
GAAAANAGGNNTTTNl^CAAGGANCNTNNNNTNGTTTO 
NCNCNTTTTTNTNNAAAAACN 
NNTNNNGGGGGGNNNANACCCCCCC 

Uniqene Name : EIF3S3 Unigene ID : Hs.5818 9 Clone ID : 3GD_18 
Human EIF3S3 mKNA sequence - varl (public gi: 2351379) (SEQ ID NO: 55) 

GAAAGATGGCGTCCCGCAAGGAAGGTACCGGCTCTACTGCCACCTCTTCCAGCTCCACCGCCGGCGCAGC 
AGGGAAAGGCAAAGGCAAAGGCGGCTCGGGAGATTCAGCCGTGAAGCAAGTGCAGATAGATGGCCTTGTG 
GTATTAAAGATAATCAAACATTATGAAGAAGAAGGACAAGGAACTG 

GTCTGGTTGTAGAAGATCGGCTTGAAATTACCAACTGCTTTCCTTTCCCTCAGCACACAGAGGATGATGC 

TGACTTTGATGAAGTCC^U^TATCAGATGGAAATGATGCGGAGCCTTCGCCATGTAAACTVTTGATCATCTT 

CACGTGGGCTGGTATCAGTCCACATACTATGGCTCATTCGTTACCCGGGCACTCCTGGACTCTCAGTTTA 

GTTACCAGCATGCCATTGAAGAATCTGTCGTTCTCATTTATGATCCCATAAAAACTGCCCAAGGATCTCT 

CTC^CTAAAGGCATACAGACTGACTCCTAAACTGATGGAAGTTTGTAAAGAAAAGGATTTTTCCCCTGAA 

GCATTGAAAAAAGC^VATATCACCTTTGAGTACATGTTTGAAGAAGTGCCGATTGTAAOT 

ATCTGATCAATGTCCTAATGTGGGAACTTGAAAAGAAGTCAGCTGTTGCAGATAAACATGAATTGCTCAG 
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C CTTG CCAGCAGC?UVTCATTTGGGGAAGAATCTAC AGTTG CTGATGGACAGAGTGGATGAAATGAGCCAA 
GATATAGTTAAATAGAACAGATACATGAGGAATACTAGTAAACAAGAGGAGCAGAA 

AGCGTCGCCAGCAGGAGAATATGCAGCGCCAGAGCCGAGGAGAACCCCCGCTCCCTGAGGAGGACCTGTC 
CAAACTCTTGAAACCACCACAGCCGCCTGCCAGGAT^ 

TAGTGCCAGAAGATCAAGGAGTTCACTGCCCAAAACTTAGGCAAGCTCTTCATC 

AATACAACAACTAAGAAAAGGAAGTTTCCAGAAAAGAAGTTAACATGAACTCTTGAAGTCACACC^ 

AACTCTTGGAAGAAAT AT ATTTG CATATTGAAAAG CACAGAGG ATTTCTTTAGTGTCATTGCCGATTTTG 

GCTATAACAGTGTCTTTCTAGCCATAATAAAATAAAAAAAi^^ 

AAAAAAAAAAAAAAAAAAAA 

Human EEF3S3 mRNA sequence - var2 (public gi: 21751901) (SEQ ID NO: 56) 

AGGCGCGTAGGAAGAGCTTCTCTGAAAGACTGGCAGTATAGTAGTAGTC^GTGATAATATTGAGCCTTAA 
TATGTTCCAGAC^CTCTCCTAAGTGATTT^^ 

AAGAAGGAAATGATATATTGGATAGCTATGAGTGGGGAGGTTTGTACTGGCTGCTTTCCCATAAAGAAAT 

TAAGCACGTTCACGAAGGGCACGTAGTl^rGTTAGTGTCTGGAACCCAGTTTTCGTGCCTGAAGTT 

GTTCTTGCTACACCACCATAGAAACTAACGTCACTCi^ 

TTTGCATTTCCTCCTGCCTTAGGACCATCCTAAAT 

GCTTCGTTTGCCTAGAGTCAGAATTeCTAACCTTG 

CTTGGGCAACTCCCGAAGAAAGCTGGGTTCAACTTCCTCACTGTCAAACTGGTTGTAGGTCTAGATAAGT 
TTGAAGTACTCTTTTTATGTGCATGGTCTCTGACATAGGAAGACTACATACTGGGCCAGTAACAGGAAGG 
CACAAAGCTGACTGGAGGTTTAAAAATTACTTGGT 

TCAAGTTCTCCCGTGGCTAACTGTCGAGTATCCACTTCAAGATCATTCCATCGGAAAGAGGTCGAAAATG 

TACAGTAGGCATGCACAAAGGATACCGCCTGGAAAGAAGATGGCGTCCCGCAAGGAAGGTACCGGCTCTA 

CTGCCACCTCTTCCAGCTCCACCGCCGGCGCAGCAGGGA2\AGGCAAAGGCAAAGGCGGCTCGGGAGATTC 

AGCCGTGAAGCAAGTGCAGATAGATGGCCTTGTGGTATTAAAGATAATCAAACATTATCAAGAAGAAG 

CAAGGAACTGAAGTTGTTCAAGGAGTGCTTTTGGGTCTGGTTGTAGAAGATCGGCTTGAAATTACCAA 

GCTTTCCTTTCCCTCAGCACACTVGAGGATGATGCTGACT^^ 

GCGGAGCCTTCGCCATGTAAACATTGATCATCTTCACGTGGGCTGGTATCAGTCCACATACTATGGCTCA 
TTCGTTACCCGGGCACTCCTGGACTCTCAGTTTAGTTACCAGCATGCCATTGAAGAATCTGTCGTTCTCA 
TTTATGATCCCATAAAAACTGCCCAAGGATCTCTCTCACTAAAGGCATACAGACTGACTCCTAAACTGAT 
GGAAGTTTGTAAAGAAAAGGATTTTTCCCCTGAAGCACTGAAAAAAGCAAATATCA 

TTTGAAGAAGTGCCGATTGTAATTAAAAATTCACATCTGATCAATGTCCTAATGTGGGAACTTGAAAAGA 

AGTCAGCTGTTGCAGATAAACATGAATTGCTCAGCCTTGCC^GCAGCAATCAITTGGGGAAGAATCTA^ 

GTTGCTGATGGACAGAGTGGATGAAATGAGCCAAGATATAGTTAAATACAACACATACATGAGGAATACT 

AGTAAACAACAGCAGCAGAAACATCAGTATCAGCAGCGTCGCCAGCAGGAGAATATGCAGCGCCAGAGCC 

GAGGAGAACCCCCGCTCCCTGAGGAGGACCTGTCCAAACTCTTCAAACCACCACAGCCGCCTGCCAGGAT 

GGACTCGCTGCTCATTGCAGGCCAGATAAACACTTACTGCCAGAACATCAAGGAGTTCACTGCCCAAAAC 

TTAGGCAAGCTCTTCATGGCCCAGGCTCTTCAAGAATACAACAACTAAGAAAAGGAAGTTTCCAGAAAA 

AAGTTAACATGAACTCTTGAAGTCACACCAGGGCAACTCTTGGAAGAAATATATTTGCATATTGAAAAGC 

ACAGAGGATTTCTTTAGTGTCATTGCCGATTTTGGCTATAACAGTGTCTTTCTAGCCATAATAAAATAAA 

ACAAAATCTTG 

Human EIF3S3 mRNA sequence - var3 (public gi: 12653234) (SEQ ID NO: 57) 

GGCACGAGGATGGCGTCCCGCAAGGAAGGTACCGGCTCTACTGCCACCTCTTCCAGCTCCACCGCCGGCG 
CAGCAGGGAAAGGCAAAGGCAAAGGCGGCTCGGGAGATTCAGCCGTGAAGCAAGTGCAGATAGATGGCCT 
TGTGGTATTAAAGATAATCA7VACATTATCAAGAAGAAGGACAAGGAACTGAAGTTGTTCAAGGAGTGCTT 
TTGGGTCTGGTTGTAGAAGATCGGCTTGAAATTACCAACTGCTTTCCTTTCCCTCAGCACACAGAGGATG 
ATGCTGACTTTGATGAAGTCCAATATCAGATGGAAATGATGCGGAGCCTTCGCCATGTAAACATTGATCA 
TCTTCACGTGGGCTGGTATCAGTCCACATACTATGGCTCATTCGTTACCCGGGCACTCCTGGACTCTCAG 
TTTAGTTACCAGCATGCCATTGAAGAATCTGTCGTTCTCATTTATGATCCCATAAAAACTGCCCAAGGAT 
CTCTCTCACTAAAGGCATACAGACTGACTCCTAAACTGATGGAAGTTTGTAAAGAAAAGGATTTTTCCCC 
TGAAGCATTGAAAAAAGCAAATATCACCTTTGAGTACATGTTTGAAGAAGTGCCGATTGTAATTAAT^AAT 
TCACATCTGATCAATGTCCTAATGTGGGAACTTGAAAAGAAGTCAGCTGTTGCAGATAAACATGAATTGC 
TC^GCCTTGCCAGCAGCAATCATTTGGGGAAGAATCTACAGTTGCTGATGGACAGAGTGGATGAAATGAG 
CCAAGATATAGTTAAATACAACACATACATGAGGAATACTAGT^ 

CAGCAGCGTCGCCAGCAGGAGAATATGCAGCGCCAGAGCCGAGGAGAACCCCCGCTCCCTGAGGAGGACC 
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TGTCCAAACTCTTCAAACCACCACAGCCGCCTGCCAGGATG^ 
CAGTTACIGCCAGAACATCAAGGBMOT 

CAAGAATAGAACAACTAAGAAAAGGAAGTTTCCAGAAAAGAAG 
GGGCAACTCTTGGAAGAAATATATTTGCAT^ 
TTTGGCTATAACAGTGTCTTTCTAGCCATAATAAAAT^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human EIF3S3 protein sequence - varl (public gi: 12653235) (SEQ ID NO: 238) 

MASRKEGTGSTATS S S STAGAAGKGKGKGGSGDSAVKQVQIDGLVVIiKI IKHYQEEGQGTEVVQGVLLGL 

VVEDRLEITNCFPFPQHTEDDADFDEVQYQMEM^ 

QHAIEESVVIiIYDPIKTAQGSLSLKAYRLTPKLMEVCK^ 

INVLMVmLEKKSAVADKHELLSLA^ 

RQQENMQRQSRGEPPLPEEDIjSKLFKPPQPPARMDSLLIAGQINTYCQN^ 
NN 



Unigene Name; EPS8L2 Unigene ID; Hs . 55016 

Human EPS8L2 mRNA sequence - varl (public gi: 21264615) (SEQ ID NO: 58) 

XSTCGACGGCCATTACCAATCXSCGAAACCCCGCAACCTGTCGCTCAGG^^ 

CGGCCCGGCCCCGCCGAGCGTCCCACCCGCCCGCGGGAGACCTGGCGCCCCGGCCGAGGCGCGAAC^G 
GGACGCACCGGCGAGCGCCGAGGGGACAGGCCGAGCGCGGGGCGCCGGAGGCAGGTGTGGGACAGGCACT 
GGCCTCAGACCGGGGCCACACTGAGGTCTGCCCTTCTCCCGCTGGCCGCCACC 
GTCCX^GGCCGTGAGCTGCTGCCCGGGTGCCACCTATGGCAGCCTGGGC^ 

ATGAGCCCCAAGGACCTGTTTGAGCAGAGGAAGAAGTATTCCAACTCCAACGTCATCATGCACGAGACCT 

CGCAGTACCACGTCCAGCACCTGGCGACATTCATCATGGACAAGAGCGAAGCCATCA 

CGCC^TCCGGAAGCTGGTGCAGCTGAGCTCC^GGAGAAGATCT 

AACGACCAGTCGTTGCGGCTGCTGGACATCGAGTCACAGGAGGAGCTGGAAGACTTCCCGCTGCCCACGG 
TGCAGCGCAGCCAGACGGTCCTCAACCAGCTGCGCTACCCGTCTGTGCTGCTGCTCGTGTGCC^GGACTC 
GGAGC^GAGCAAGCCGGATGTCCACTTCTTCCACTGCGATGAGGTGGA 

ATCGAGAGCGCGTTGGCCGACTGCCGGCTGGGCAAGAAGATGCGGCCGCAGACCCTGAAGGGACACCAGG 
AGAAGATTCGGCAGCGGCAGTCCATCCTGCCTCCTCCCCAGGGCCCGGCGCCCATCCCCTTCCAGCACCG 
CGGCGGGGATTCCCCGGAGGCCAAGAATCGCGTGGGCCCGCAGGTGCCACTCAGCGAGCCAGGTTTCCGC 
CGTCGGGAGTCGCAGGAGGAGCCGCGGGCCGTGGTGGCTCAGAAGATAGAGAAGGAGACGCAAATCCTCA 
ACTGCGCCCTGGACGACATCGAGTGGTTTGTGGCCCGGCTGCAGAAGGCAGCCGAGGCTTTCAAGCAGCT 
GAACCAGCX3GAAAAAGGGGAAGAAGAAGGGCAAGAAGGCGCCAGCAGAGGGCGTC 

CGGCCCCCCTCTGAGGGCGAGTTGATCGACTGCTTCCAGAAAATCAAGCTGGCGATTAACTTGCTGGCAA 
AGCTGCAGAAGCACATCCAGAACCCCAGCGCCGCGGAGCTCGTGCACTTCCTCTTCGGGCCTCTGGACCT 
GATCGTCAACACCTGCAGTGGCCCAGACATCGCACGCTCCGTCTCCTGCCCACTGCTCTCCCGAGATGCC 
GTGGACTTCCTGCGCGGCCACCTGGTCCCTAAGGAGATGTCGCTGTG 

TGCGGCCCCGTTCCGAGTGGCCGCGGGAGCCACAGGTGCCCCTCTACGTGCCCAAGTTCCACAGCGGCTG 
GGAGCCTCCTGTGGATGTGCTGCAGGAGGCCCCCTGGGAGGTGGAGGGGCTGGCGTCTGCCCCCATCGAG 
GAGGTGAGTCCAGTGAGCCGACAGTCCATAAGAAACTCCCAGAAGCACAGCCCCACTTCAGAGCCCACCC 
CCCCGGGGGATGCCCTACCACCAGTCAGCTCCCGACATACTCACAGGGGCTACCAGCCAACACCAGCCAT 
GGCCAAGTACGTCAAGATCCTGTATGACTTCACAGCCCGAAATGCCAACGAGCTATCGGTG CT CAAGGAT 
GAGGTCCTAGAGGTGCTGGAGGACGGCCGGCAGTGGTGGAAGCTGCGCAGCCGCAGCGGCCAGGCGGGGT 
ACGTGCCCTGCAACATCCTAGGCGAGGCGCGACCGGAGGACGCCGGCGCCCCGTTCGAGCAGGCCGGTCA 
GAAGTACTGGGGCCCCGCCyVGCCCGACCCACAAGCTACCCCCAAGCTTCCCGGGGAACAAAGACGAGCTC 
ATGCAGCACATGGACGAGGTCAACGACGAGCTCATCGGGAAAATCAGCAACATCAGGGCGCAGCCACAGA 
GGCACTTCCGCGTGGAGCGCAGCCAGCCCGTGAGCCAGCCGCTCACCTACGAGTCGGGTCCGGACGAGGT 
CCGCGCCTGGCTGGAAGCCAAGGCCTTCAGCCCGCGGATCGTGGAGAACCTGGGCATCCTGACCGGGCCG 
CAGCTCTTCTCCCTCAACAAGGAGGAGCTGAAGAAAGTGTGCGGCGAGGAGGGCGTCCGCGTGTACAGCC 
AGCTCACCATGCAGAAGGCCTTCCTGGAGAAGCAGCAAAGTGGGTCGGAGCTGGAAGAACTCATGAACAA 
GTTTCATTCCATG^TCAGAGGAGGGGGGAGGACAGCTAGGCCCAGCTGCCTTGGGCTGGGGCCTGCGGA 
GGGGAAGCCCACCCACAATGCATGGAGTATTATTTTTATATGTGTATGTATTTTGTATCAAGGACACGGA 
GGGGGTGTGGTGCTGGCTAGAGGTCCCTGCCCCTGTCTGGAGGCACAACGCCCATCCTTAGGCCAAACAG 
TACCCAAGGCCTCAGCCCACACCAAGACTAATCTGAGCGAA^CCTGCTGCTTGGTGGTGCCAGCCCCrTG 
TCCACCTTCTCTTGAGGCCACAGAACTCCCTGGGGCTGGGGCCTCTTTCTCTGGCCTCCCCTGTGCACCT 
GGGGGGTCCTGGCCCCTGTGATGCTCCCCCATCCCCACCCACTTCTACATCCATCCACACCCCAGGGTGA 
GCTGGAGCTCCAGGCTGGCCAGGCTGAACCTCGCACACACGCAGAGTTCTGCTCCCTGAGGGGGGCCCGG 
GAGGGGCTCCAGCAGGAGGCCGTGGGTGCCATTCGGGGGAAAGTGGGGGAACGACACACACTTCACCTGC 
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AAGGGCCGACAACGCAGGGGAC^CCX3TGCCGGCTTCAGACACTCC<^^ 
ACTGGAGCTTTCTCTGGCCAAGTTTGAGGCCAA 
GGTGCCTGGACTTGAGTCTCACCCTACAGATGAG 
GTTAAGTCTCCCAGGAAAAAAAAAAAAAAAA 

Human EPS8L2 protein sequence - varl (public gi: 21264 sis) (SEQ ID NO: 239) 

MSQSGAVSCCPGATNGSLGRSDOTAKMS^ 

VDDAIRKLVQLSSKEKIWTQEMliljQVNDQSIiRIjIiDIESQEEIiEDFPLPTVQR^ 

QDSEQSKPDVHFFHC3DEVEAEbVHEDIESAIiADCRLGK3CMRPQTLKGHQEKIRQRQSI LPPPQGPAPI PF 
QHRGGDS PEAKNRVGPQVPLSEPGFRRRESQEE PRAVLAQKIEKETQ I LNCALDDI EWFVARLQKAAEAF 
KQLNQRKKGKKKGKKAPAEGVLTIaRARP 

liDLIVNTCSGPDIARSVSCPLLSRDAVDFIiRGHLVPKEMSLWESLGESWm 

SGWEPPVDVLQEAPWEVEGLASAPIEEVS PVSRQS IRNSQKHSPTSEPTPPGDALPPVSSPHTHRGYQPT 

PAMAKYVKILYDFTARNANELSVLKDEVLEVL^ 

AGQKYWGPASPTHKLPPSFPGNKDEIjMQH^EVl^EM 

DEVRAWLEAKAFSPRIVENIjGIIjTGPQIjFSLiNKEEIiKKVCGEEGVRV^ 

MNKFHSMNQRRGEDS 

Human EPS8L2 pray sequence - varl (SEQ ID NO: 59) 

TCNTNCGCCGCCATGGNAGTACCCATACGACGTACCAGN 
AATTCCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATT^ 
AGCACATGGACGAGGTCAACX3ACGAGCTCATCCGGAAA 
CTTCCGCGTGGAGCGCAGCCAGCCCX3TGAGCCAGCCGCT 

GCCTGGCTGGAAGCCAAGGCCTTCAGCCCGCGGATCGTGGAGAACCTGGGCATCCrGACCGGGCCGCAGC 

TCTTCTCCCTCAACAAGGAGGAGCTGAAGAAAGTGTGCGGCGAGGAGGGCGTCCGCGTGTACAGCCAGCT 

CACCATGCAGAAGGCCTTCCTGGAGAAGCAGCAAAGTGGGTCGGAGCTGGAAGAACTC^ 

CATTCCATGAATCAGAGGAGGGGGGAGGAGAGCTAGGCCCAGCTGCCTTGGGCTGGGGCCTGCGGAGGGG 

AAGCCCACCCACAATGCATGGAGTATTATTTTTATAT^^ 

GGTGTGGTGCTTGGCTANAGGTCCCTGCCCCTC^ 

TNACCCAANGGCCTNAACCCCAANCAAGNTTATTTTN^ 

CCNTTGTTGGTGCQJNNCCOJTTGTNCANCOT 

CNTTTTTTNTNN 



Human EPS8L2 pray sequence - , var2 (SEQ ID NO: 60) 

CGAGCGCCGCCTGGNATACCCATACGACGTACCAGNAT^ 
TCCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATG 

ACATGGACGAGGTCAACGACGAGCTCATCCGGAAAATCAGCAACATCAGGGCGGAGCCACAGAGG^ 

CCGCGTGGAGCGGAGCCAGCCCGTGAGCCAGCCGCTCACCTACGAGTCGGGTCCGGACGAGGTCCGCGCC 

TGGCTGGAAGCCAAGGCCTTCAGCCCGCGGATCGTGGAGAACCTGGGCATCCTGACCGGGCCGCAGCTCT 

TCTCCCTCAACAAGGAGGAGCTGAAGAAAGTGTGCGGCGAGGAGGGCGTCCGCGTGTACAGCCAGCTCAC 

CATGCAGAAGGCCTTCCTGGAGAAGCAGCAAAGTGGGTCGGAGCTGGAAGAACTCATGAACAAGTTTCAT 

TCCATGAATCAGAGGAGGGGGGAGGACAGCTAGGCCCAGCTGCCTTGGGCTGGGGCCTGCGGAGGGGAAG 

CCCACCCACAATGCATGGAGTATTATTTTTATATGTGTATGTATTTTGTATCAAGGACACGGAGGGGGTG 

TGGTGCTGGCTANAGTTCCCTGCCCCT^ 

NCTNANCCCAGACCAANACTATTTTAACCNAACT 

CCNTTNTCCGTTTTTTTTGNGNCCNAAAATTC^^ 

TNGGGGTTCTGGNC(^TNNmTNTNCCCCTNCCCCCCNTTTTNNNTNTTN 



Human GOCAP1 niRNA sequence - varl (public gi: 10438O60) (SEQ ID NO: 61) 

GATACGTGGCTGCCGTCTGTCCCCGCTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTGAACGCAG 
AGCGACTCGAGGTGTCCGTCGACGGCCTCACGCXCAGCCCGGACCCGGAGGAGCGGCCTGGGGCGGAGGG 
CGCCCCGCTGCTGCCGCCACCGCTGCC^CCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGGCGCCrCA 
GGGGAGCAGCCCGAGCCCGGGGAGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGC 
GCTGGGGTTTCGGCCTGGAGGAGTTGTACGGCCTGGCACTGCGCTTCTTCAAAGAAAAAGATGGCAAAGC 
ATTTCATCCAACTTATGAAGAAAAATTGAAGCTTGTGGCACTGCATAAGCAAGTTCTTATGGGCCCATAT 
AATCCAGACACTTGTCCTGAGGTTGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAG 
CCCTGGGAAACATGTCTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTTGCCATCT 
CTTTTCAACATATGTTGCGTCCCACAAAATAGAGAAGGAAGAGCAAGACAAAAAAAGGAAGGAGGAAGAG 
GAGCGAAGGCGGCGTGAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGGAGAGAAG 
AAGAGGAAAGGCTTCGACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCA 
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GCAAAAGCAGCAGATAATGGC^GCTTTAAACTCC 

CAGTATCCAGGGAACTACGAACAGCAGCAAACT 

ACATGGAGCAGTTGTATCAAGTCCAGCTTGC^ 

AGTGGCTGGGTCTTCCTTGCCTACA.TCATCAAAAGTGAATGCAACTGTACCAAGT 
AATGGACAGGCCAAAACACACACTGACAGCTCCGAAAAAGAACTGGA^ 
TGGAGAATGGACCAAAAGAATCTCTTCCAGTAATAG<^ 
AGACTTCAAAGAGAAGATTCAGCAGGATGGAGAT^ 

GTTCGAGTACCCACCCATGAAGAAGGATCATATCTCTTTTGGGAATTTGCCA 
GGTTTGGGGTGTATTTTGAATGGACAGACTCT 

CGATGACGATOAGGAGGAAGAAGAAAACATCGGTTGTGAAGAGAAAGCCAA 
TTGCTGGATGAGATTGTGCCTGTGTACCGACGGGACTGT 

ATCC^GGGAGAGGAGTCTATCTCCTCAAGTTTGACAACTCCTACTCTTTGTGGCGGT 
CTAGAGAGTCTATTATACTAGATAAAAATGTTGTTACAAAGTCT 
CATTTAATTTGGAAATTTCTTTTTACT^ 
CTGATGGTTTGTGAACTCITGCTGGGAAT 

TAAAGGAGATTCCTCAGACTGATCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGTGG 

TACATGAGCTACATGTTAAATATTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTTACCCGGCTG 

ATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCTAATTTGA 

AATCCATAAGTTAACAAGTCTATATCAGGTGC^GCTGGCTTTGATTAAAGGCC^TTTTTAAAACTTAAAA 

ACTCAACACCTCACAGATTATAAAAAAAAAAAAAAAAAAA 

Human GOCAP1 niKNA sequence - var2 (public gi: i582685i) (SEQ ID NO: 62) 

GGAAGTCGATACGTGGCTGCCTTCTGTCCCCGCTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTG 
AACGCAGAGCGACTCGAGGTGTCCGTCGACGGCCTCACX3CTCAGCCCGGACCCGGAGGAGCGGCCTGGGG 
CGGAGGGCGCCCCGCTGCTGCCGCCACCGCTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGG 
CGCCTCAGGGGAGCAGCCCGAGCCCGGGGAGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTG 
GAGCAGCGCTGGGGTTTCGGCCTGGAGGAGTTGTAC^ 

GCAAAGCATTTCATCCAACTTATGAAGAAAAATTGAAGCTTGTGGCACTGCATA 
CCCATATAATCCAGACACTTGTCCTGAGGTTGGATTCTTTGATGTGTTGGGGAATGACAGGAG 
TGGGCAGCCCTGGGAAACATGTCTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTT 
GCCATCTCTTTTCAACATATGTTGCGTCCCACAAAA 

GGAAGAGGAGCGAAGGCGGCGTGAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGG 
AGAGAAGAAGAGGAAAGGCTTCGACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGT 
TGGAGCAGCAAAAGCA.G(^GATAATGGCAGCTTTAAACTCCCAG 

AGCCCAACAGTATCCAGGGAACTACGAACAGCAGCAAATTCTCATCCGCCAGTTGCAGGAGG 
CAGCAGTACATGCAGCAGTTGTATCAAGTCCAGCTTGCACAGGAACA 

AAGTAGTAGTGGCTGGGTCTTCCTTGCCTACATCATCAAAAGTGAATGCAACTGTACCAAGTAATATGAT 
GTCAGTTAATGGACAGGCCAAAACACACACTGACAGCTC 

GAAGCCCTGGAGAATGGACCAAAAGAATCTCTTCCAGTAATAGCAGCTCCATCCATGTGGACACGAC^ 
AGATCAAAGACTTCAAAGAGAAGATTCAGCAGGATGCAGACT 

GGTCACTGTTCGAGTACCCACCCATGAAGAAGGATGATATCTCTTTTGGGAATTTGCCACAGACAATTAT 
GACATTGGGTTTGGGGTGTATTTTGAATGGACAGACTCTCGAAACIACTGCTGTCAGCGTGCATGTCAGTG 
AGTCCAGCGATGACGACGAGGAGGAAGAAGAAAACATCGGTTGTGAAG^^ 

CAAGCCTTTGCTGGATGAGATTGTGCCTGTGTACCGACGGGACTGTCATGAGGAGGTGTATGCTGGCAGC 
CATCAATATCCAGGGAGAGGAGTCTATCTCCTCAAGTTTGACAACTCCTACTCTTTGTGGCGGTCAAAAT 
CAGTCTACTACAGAGTCTATTATAOTAGATAAAAATGTTGTTACAAAGTCTGGAGTCTAGGGTTGGGCAG 
AAGATGACATTTAATTTGGAAATTTCTTTTTACTT 

TATTGGTCTGATGGTTTGTGAACTCrrGCTGGGAATCAAAATTTCCTTGAGACTCTTTAGG^ 
TTGGGGTTAAAGGAGATTCCTCAGACTCATCCAGCC^ 

CTGAAGTTACATGAGCTACATGTTAAATATTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTTAC 
CCGGCTGATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCT 
AATTTGAAATCCATAAGTTAACAAGTCTATATC^ 

CTTAAAAACTCAACACCTCACAGATTATAATAGAAAAAGAAATGGCCTCAGTTTGATCTCGTTCAGAATG 
ACCCAGATTGTTTCTGCTTTGGGTGCAGCTGTTTAGTTC^ 

ATAATCTTAAACTAGAATGTTCAAAACTAATTGATAATTGAAGTATCAAGATACGTAGAACACCTCAGAG 
ATTTTTCTTCAGGAACTTCCAGAAACTTTGAAT^ 

AAAATACAGGTTTTTTGTTTTGTTTTGTTTTGGCTTCATAGAGTATCTCAAATTGAAACTTT 

AAGAATAAAATTAAGGATTTTATAAACTCAAATTGGCACCTACTGAATTAAAATACATAAAATCAOT 

ATATAATTCAGCATATGGGAAGTAACATTGCACTAATATGGAAATCACTGCCAGAGACAGTCTATTTTCT 

TTTAATTTGTTACTACTTAGTCACAAACCCCACATTATTCCAGTTTGGAATTACTTATTAAGGAGAATTG 

GAAATACATATGCCCATGCTTAAATTTTATAGCTTTAATTTGTGTTATTTCTTTATTGACGGGAAGAGGT 

ACATCTTTTTTTCCTTACTGAAAACCAAATATGG 

GTGATTCTTGTTTTCAGGAAGGGAGAGTGGTATAGATAGAAAATGACAAAGATGGCAATATACACTTAAT 
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GTTGTTATTGTATGTTGTrACTGAA^ 

GAGGGTTTTCATTAACTGCAGTATATACAGTTCACHACATATGGGTTGT^ 

TGGATTTTTTTTTTTTCTTAAGCAGTGCTTAATTTGTGTTTTTTAATTTTGATTC^ 
TCATAGGTGTTCATAACTGTTACATCCAGAACATTTGTCAGGCTCTCTGT^ 
GTATAGAAACCATGGAGTTAGGCACTTCCCTGGAATTTTT^ 
TGTAAAATAAACTTTTAAAAAGCAGGCACTAATATATATTTCTC^ 

TGCACATGTTAAGATGAATTATCTCCTAAAAATATCATTGTTCTTGGGAGCAGTGTATGTTACTTTACAT 
AGCAGCGGTTCCTGTCATGTGTTCATGTTCAGAAATATITTT^^ 

TGTTGATTAGTACAGTACAAGTTGCX1ATTTCAAAAAGATCTTGAAAGTAATATATTTAATCAATTA 
GTTTATCTGTAAAAAAAAAAAAAAAAAA 

Human GOCAP1 mRNA sequence - var3 (public gi: 15799258) (SEQ ID NO- 63) 

GGAAGTCGATACGTGGCTGCCTTCTCTCCCCGCTGAGGAGGTCX^GC^GCCGGAC^TGGCGGCGGTGCTG 
AACGCAGAGCGACTCGAGGTGTCCGTCGACGGCCTCACGCTCAGCCCGGACCCGGAGGAGCGGCCTGGGG 
CGGAGGGCGCCCCGCTGCTGCCGCCACCGCTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGG 
CGCCTCAGGGGAGCAGCCCGAGCCCGGGGAGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTG 
GAGCAGCGCTGGGGTTTCX3GCCTGGAGGAGTTGTACGGCCTGGCACTGCGCTTCTTCAAAGAAAAAGATG 
GCAAAGCATTTCATCCAACTTATGAAGAAAAATTGAAGCTTGTGGCACTGCATAAGGAAGTTCTTATGGG 
CCCATATAATCCAGACACTTGTCCTGAGGTTGGATTCTTTGATGTGTTGGGGAATGA(^GGAGGAGAGAA 
TGGGCAGCCCTGGGAAACATGTCTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTT 
GGCATCTCTXTTCAACATATGTTGCGTCCCACAAAATAGAGAAGGAAGAGCAAGAAAAAAAAAGGAAGGA 
GGAAGAGGAGCGAAGGCGGCGTGAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGG 
AGAGAAGAAGAGGAAAGGCTTCGACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGT 
TGGAGCAGCAAAAGCAGCAGATAATGGCAGCTTTAAACTCCGAGACTGCCGTGCAGTTCCAGCAGTATGC 
AGCCCAACAGTATCCAGGGAACTACGAACAGCAGCAAATTCTCATCCXKICAGTTGCAGGAGCAACACTAT 

AAGTAGTAGTGGCTGGGTCTTCCTTGCCTACATCATCAAAAGTGAATGCAACTGTACCAAGTAATATGAT 
GTCAGTTAATGGACAGGCCAAAACACACACTGACAGCTCCGAAAAAGAACTGGAACCAGAAGCTGCAGAA 
GAAGCCCTGGAGAATGGACCAAAAGAATCTCTTCCAGTAATAGCAGCTCCATCCATGTGGACACGACCTC 
AGATCAAAGACTTCAAAGAGAAGATTCAGCAGGATGCAGATTCCGTGATTACAGTGGGCCGAGGAGAAGT 
GGTCACTGTTCGAGTACCCACCCATGAAGAAGGATCATATCTCTTTTGGGAATTTGCCACAGACAATTAT 
GACATTGGGTTTGGGGTGTATTTTGAATGGACAGACTCTCCAAACACTGCTGTCAGCGTGCATGTCAGTG 
AGTCCAGCGATGACGACGAGGAGGAAGAAGAAAACATCGGTTGTGAAGAGAAAGCCAAAAAGAATGCCAA 
CAAGCCTTTGCTGGATGAGATTGTGCCTGTGTACCGACGGGACTGTCATGAGGAGGTGTATGCTGGCAGC 
CATCAATATCCAGGGAGAGGAGTCTATCTCCTCAAGTTTGACAACTCCTACTCTTTGTGGCGGTCAAAAT 

AAGATGACATTTAATTTGGAAATTTCTTTTTACTTTTGTGGAGCATTAGAGTCACAGTTTACCTTATTGA 
TATTGGTCTGATGGTTTGTGAACTCTTGCTGGGAATCAAAATTTCCTTGAGACTCTTTAGCATTCATACT 
^ GG ™^ GGAGATTCCTCAGACTCATCCAGCCC ^ GGTG ^ 

CTGAAGTTACATGAGCTACATGTTAAATATTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTTAC 
CCGGCTGATGGTTAGCCCCTTGCTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCT 
AATTTGAAATCCATAAGTTAACAAGTCTATATC^GGTGCATCTGGCTTTGATTAAAGGCCATTTTTAAAA 
CTTAAAAACTCAACACCTCACAGATTATAATAGAAAAAGAAATGGCCTCAGTTTGATCTCGTTCAGAATG 
ACCCA ^ GTTTCTGCTTTGGGTGCAGCTG ™^^ 

ATAATCOTAAAOTAGAATGTTC^AACTAATTGATAATTGAAGTATCAAGATACGTAGAACACCTCAGAG 
ATTTTTCTTCAGGAACTTCCACAAACTTTGAATCCTTGTATCTTTATTTGGTATTCATACTACTAGTAGC 
AAAATACAGGTTTTTTGTTTTGTTTTGTTTTGGCTTCATA^GTATCTCAAA 

™™ CAGCATATGGGAAGTAACATOGCACTAATATGGAAA ^ 

J™™ TTACTACTTAGTCACAAACCCCACATTATTCCAGT ^ 
GAAA J ACA ! ATOCCCAT ^^ 

ACATCTTTTTTTCCTTACTGAAAACCAAATATGGATTAATTGCCTCAAATTTGTATAAA^ 

G I™ C ^ T ^ CAGGAAGGGAGAGTGGTATAGATAGAA ^ 
GTOTTATTGTATGTTGTTACTC^ 

GAGGGTTTTCATTAACTGCAGTATATACAGTTCACTACATATGGGTTGTTTGAGTTTTTTGTGTGCTGTA 
TTTCTTTCTGTTTTTTTAATACCTGGTTTTGTACATATCTAACTCTGTTCTCTTTTGGTTGTTCAGAAAC 



TCATAGGTGTTCATAACTGTTACATCCAGAACATTTGTCAGGCTCTCTGTCAGCTTTTCATGTACATATG 

TGTAAAATAAACTTTTAAAAAGCAGGCACTAATATATATTTCTTCCAGCCTTTGATTACAAATCTGTC^ 
TGCACATGTTAAGATGAATTATCTCCT^^ 

AGCAGCGGTTCCTGTCATGTGTTCATGTTCAGAAATATTTraGGTTTT 
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TGTTGATTAGTACAGTACAAGTTGCGATTTCAAAAAGAT 
GTTTATCTGTAAAAAAAAAAAAAAAAAA 

Human GOCAP1 mRNA sequence - var4 (public gi: 21961496) (SEQ ID NO: 64) 

CGGACX3GGTGGGTGCCATCTCTTTTCAACATATGTT 

AAAAAAGGAAGGAGGAAGAGGAGCX3AAGGCGGCGTGAAGAGGAAGAAAGA 

AGAGAAACGTAGGAGAGAAGAAGAGGAAAGGCTTCGACGGGAGGAAGAGGAAAGG 

GAAAGGCTTCGGTTGGAGCAGCAAAAGCAGCAGATAATGGCAGCTTTAAACTCCCA 

TCC^GCAGTATGCAGCCCAACAGTATCCAGGGAACTACGAACAGCAGCAAATTCT 

GGAGCAACACTATCAGCAGTACATGCAGCAGTTGTATCAAGTCCAGCT 

CAGAAACAACAGGAAGTAGTAGTGGCTGGGTCTTCCT^ 

GAAGTAATATGATGTCAGTTAATGGACAGGCCAAAACAC^ 

AGAAGCTGCAGAAGAAGCCCTGGAGAATGGACCAAAAGAATCTCTTCCAGTA 

TGGACACGACCTCAGATCAAAGACTTCAAAGAGAAGATTCAGCAGGATGC^ 

GCCGAGGAGAAGTGGTCACTGTTCGAGTACCCAC CCATGAAGAAGGATCATATCT CTTTTGGGAATTTGC 

CACAGACAATTATGACATTGGGTTTGGGGTGTATTTTGAATGGACAGACTCTCC 

GTGCATGTCAGTGAGTCCAGCGATGACGATOAGGAGGAAGAAGAAAAC^TCGGTTGTGAAGA 

AAAAGAATGCCAACAAGCCTTTGCTGGATGAGATTGTGCCTGTGTACCGACGGGACT 

GTATGCTGGCAGCCATCAATATCCAGGGAGAGGAGTCTATCTCCTCAAGTTTGAGAACTCCTA 

TGGCGGTCAAAATCAGTCTACTACAGAGTCTATTATACTAGATAAAAATGTTGTTACAAAGTCTGGAGTC 

TAGGGTTGGGCAGAAGATGACATTTAATTTGGAAATTTCTTTTTACTTTTGTGGAGCATTAGAG 

TTTACCTTATTGATATTGGTCTGATGGTTTGTGAACTCTTGCTGGGAATCAAAATTTCCTTGApACTCTT 

TAGCATTCATACTTTGGGGTTAAAGGAGATTCCTCAGACTCATCCAGCCCT 

TCACTAGTGK3ATGCTGAAGTTAGATGAGCTAC^ 

ACGTTGACCTTACCCGGCTGATGGTTAGCCCCT^ 

TACAGTGTCCTCTAATTTGAAATCCATAAGTTAACAAGTCTATATCAGGTGCAGCTGGCTTTGATTAAAG 

GCCATTTTTAAAACTTAAAAACTCAAC^CCTC^ 

CTCGTTCAGAATGACCCAGATTGTTTCTGCTTTGGGTGC^ 

ATTATTTTCTGAGATAATCTTAAACTAGAATGTTCAAAACTAATTGATAATTGAAGTATC 
TACTACTAGTAGCAAAATACAGGTTTTTTGTTTT 

TTGAAACTTTTCTGCACAAAGAATAAAATTAAGGATTTTATAAACTGAAATTGG 

ATACATAAAATCATTTAAATATAATTCAGCAm^ 

AGAGAmGTCTATTTTCTTTTAATTTGTTACT^ 

ACTTATTAAGGAGAATTGGAAATACATATGCCGATGCTTAA^ 

TTATTGACGGGAAGAGGTACATCTTTTTTTCCTTACTGi^^ 

TATAAGTGATTGGCTAGTGATTCTTGTTTTCAGAAGGGAGAGTGGTATAGATAGAAAATGACAAAGATGG 
CAATATAC^CTTAATGTTGTTATTGTATGTTGTTACTG 

AATCACTTCTTGTAGGAGGGTTTTCATTAACTGCAGTATATACAGTTCACTACATATGGGTTGTTTGAGT 
TTTTTGTGTGCTGTATTTCTTTCTGTTTTCT 

GGTTGTTGAGAAACTGGATTTTTTTTTTTCTTAAGCAGTGCTTAATTTGTGTTTTTTAATTTTGATTCAG 
AAGTAGTCCCAGCTCATAGGTGTTCATACTGTTACATCC^GAACATTTGTCAGGCTCTCTGTGAGCTTT^ 
ATGTACATATGGTATAGAAACCATGGAGTTAGGCACTTCCTGGATTTTTTTTTTATGAGAAAAATACTGT 
ATTTAAAATGTAAAATAAACTTTTAAAAAGCAGGCACT^ 

TTTGTCCTTGC^CATGTTAAGATGAATTATCTCCTAAAAATATCATTGTTCTTGGGAGCAGTGTATGTTA 
CTTTACATAGCAGCGGTTCCTGTCATGTGTTCATGTCAGAGTATTTTTGGTTTTAAACTTTCTTATTGCC 
TTTGGCTGTTGATTAGTACAGTAGAAGTGCGATTTCAAAAAGATCTTGAAAGTAATATATTTAATCAAOT 
AAAATGTTTAT CTGTAAAAAAAAAAAAAAAAAAAAAAAA 

Human. GOCAP1 mRNA sequence - var5 (public gi: 24496472) (SEQ ID NO: 65) 

CCGCTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTGAACGCAGAGCGACTCGAGGTGTCCGTCGA 

CGGCCTCACGCTCAGCCCGGACCCGGAGGAGCGGCCTGGGGCGGAGGGCGCCCCGCTGCTGCCGCCACCG 

CTGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGGCGCCTCAGGGGAGCAGCCCGAGCCCGGGG 

AGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGCGCTGGGGTTTCGGCCTGGAGGA 

GTTGTACGGCCTGGCACTGCGCCTCTTCAAAGAAAAAGATC 

AAATTGAAGCTTGTGGCACTGCATAAGCAAGTTCTTA^ 

TTGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAGCCCTGGGAAACATGTCTAAAGA 

GGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTTGCCATCTCTTTTCAACATATGTTGCGTCC 

CACAAAATAGAGAAGGAAGAGCAAGACAAAAAAAGGAAGGAGGAAGAGGAGCGAAGGCGGCGTGAAGAGG 

AAGAAAGAGAGCGTCTGCAAAAGGAGGAAGAGAAACGTAGGAGAGAAGAAGAGGAAAGGCTTCGACGGGA 

GGAAGAGGAAAGGATACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCAGCAAAAGCAGGAGATAA^ 

GCTTTAAACTCCCAGACTGCCGTGC^GTTCCAGCAGTATGCAGCCCAACGGTATCCAGGGAACTACGA^ 
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AGCAGCAAATTCTCATCCGCC^GTTGCAGGAGCAACACTATCAG 

CCAGCTTGCACAGCAACAGGCAGCATTAG^^ 

ACATCATCAAAAGTGAATGCAACTGTACCAAGTAATATGATGC 

CTCACAGCTXXXiAAAAAGAACTGGAACCAGAAGCTG 

TCTTCCAGTAATAGCAGCTCCATCCATGTGGACA<^^ 

GCAGGATGCAGATTCCGTGATTACAGTGGCCX3AGGAGAAGTGGTCACT 

AAGGATCATATCTCTTTTGGGAATCTGCCACAGACAATTC 

GACAGACTCTCCAAACACTGCTGTCAGCGTGCAT^ 

GAAAACATCX5GTTGTGAAGAGAAA.GCCAAAAAGAATGCCAAGAAGCCT 

TGTACCGACGGGACTGTGATGAGGAGGTGTATGCTGGC^^ 

CCTCAAGTTTGACAACTCCTACTCTTTGTGGCGGTCAAAATCAG 

TAAAAATGTTGTTACAAAGTCTGGAGTCTAGGGTTGGGCAGAAGATG 

TTACTTTTGTGGAGCATTAGAGTCACAGTTTACCTTATTGATATTGGTCTGATGGTTTGTGAACTCT 

TGGGAATCAAAATTTCCTTGAGACTCTTTAGCATTCATACTTTGGGGTTAAAGGAGATTCCT 

TCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGGGGATGCTGAAGTTACATGAGCT 

TTTAAAGTCTCCAAAATAAAA^CCCCAACGTTGACCTTACCCG^ 

CTCCATGTGTCTTATGAGAGCCCGTAGTTACAGTGTCCTCT^ 

TATCAGGTGCAGCTGGCTTTGATTAAAGGCCATT^ 

ATAGAAAAAGAAATGGCCTCAGCTTGATCTCGTTCAGAATGACCG^^ 

TGTTTAGTTCAGAGTTATATTACAGAGAATTATTTTCTGAGATAATCTTAAACT 

ATTGATAATTGAAGTATCAAGATACGTAGAACACCTCAGAGATTTTTCTTCAGGAACTO 

GAATCCTTGTATCTTTATTTGGTATTCATACTACTAGTAGCAAAA.TACAGGTTTTTTGTTOT 

TTGTTTTGGCTTGATAGAGTATCTCAAATTGAAACTTTTCTGCACAA 

aactcgaattggc^cctactgaattaaaatacataaaatcatttaaatataattcagcat^ 
acattgcactaatatggaaatc^cixk:cagaga<^gtctattttctt^ 

aac c c cacattatt c c agt ttggaatt ac tt att aagg ag aattggaaat acatgtgc c catgcttaaat 

TTTATAGCTTTAATTTGTGTTATTTCTTTATTGACGGGAAGAGGTACATCTTTTTTTCCTTACTGAAAAC 

AAATATGGATTAATTGCCTCAAATTTGTATAAAGTGATTGGCTAGTGATTCTTGTTTTCAGAGGGGAGAG 

TGGTATAGATAGAAAATGACGAAGATGGCAATATACACTTAATGTTGTTATTGTATGTTGTTACTGAAGA 

CTTAGATTTTTAAAATTTCAAATCCTAAATCACTTCTTGTAGGGGGGTTTTCATTAACTG^ 

AGTTCACTACATATGGGTTGTTTGAGTTTTTTGTGTGCTGTATTTCTTTCTGTTTTTTAATACCTGGTTT 

TGTACATATCTAACTCTGTTCTCTTTTGGTTGTTCAGAAACTGGATTTTTTTCTTCTTAAGCAGTGCTTA 

ATTTGTGTTTTTTAATTTTGATT(^GAAGTAGTCCCAGCTCATAGGCGTTCATACTGTTAC^ 

ATTTGTCAGGCTCTCTGTCAGCTTTCATGTACATATGGTATA 

TTTTTTTTATGAGAAAAATNCTGTATTTAAAATGTAAAATAAACTTT^^ 

TTTCTTCCAGCCTTTGATTACAAATTTGTCCTTGCACATGTTAAGATGAATTATCTCCTAAAAATAT 

TGTTCTTGGGAGCAGTGTATGTTACTTTACATAGCAGCGGTTCCTGTCATGTGTTCATGTCACGAATATT 

TTTGGTTTTAAACTTTCTTATTGCCTTTGGCTGTTGATTAGTACAGTAC^ 

TTGAAAGTAATATATTTAATCAATTAAAATGTTTATCTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AA 

Human GOCAP1 mRNA sequence - var6 (public gi: 28374435) (SEQ ID NO: 66) 

TCCGTCCCCGCTGAGGAGGTG<^GCAGCGGGAGATGGCGGCGGTGCTGAACGCAGAGCGACTCGAGGTGT 
CCGTCGACGGCCTC^CGCTCAGCCCGGACCCGGAG^ 

GCC^CCGCTGC(^CCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGGCGCCTCyVGGGGAGCAGCCCGAG 

CCCGGGGAGGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGCGCTGGGGTTTCGGCC 

TGGAGGAGTTGTACGGCCTGGCACTGCGCTTCTTCAAAGAAAAAGATGGCAAAGCATTTCATCCAACTTA 

TGAAGAAAAATTGAAGCTTGTGGCACTGCATAAGCAAGTTCTTATGGGCCCATATAATCGAGACACT 

CCTGAGGTTGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAGCCCTGGGAAACATGT 

CTAAAGAGGATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTTGCCATCTCTTTTCAACATATGT 

TGCGTCCCACAAAATAGAGAAGGAAGAGCAAGAAAAAAAAAGGAAGGAGGAAGAGGAGCGAAGGCGGCGT 

GAAGAGGAAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGGAGAGAAGAAGAGGAAAGGCTTC 

GACGGGAGGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCAGCi^AAAGCAGCAGAT 

AATGGC^GCTTTAAACTCCCAGACTGCCGTGCAGTTCC^GCAGTATGCAGCCCAACAGTAT 

TACGAAC^GCAGCAAATTCTCATCCGCCAGTTGCAGGAGCAACACTATCAGCAGTACATGCAGCAGTTGT 

ATCAAGTCC^GCTTGCACAGCAACAGGCAGCATTACAGAAACAAC 

CTTGCCTACATCATCAAAAGTGAATGCAACTGTAC CAAGTAATATGATGTCAGTTAATGGACAGG C CAAA 
AC^CACACTGACAGCTCCGAAAAAGAACTGGAACCGGAAGCTGCAGAAGAAGCCCTGGAGAATGGACCAA 
AAGAATCTCTTCCAGTAATAGCAGCTCCATCCATGTGGACACGACCT^ 

GATTCAGCAGGATGCAGATTCCGTGATTACAGTGGGCCGAGGAGAAGTGGTCACTGTTCGAGTACCCACC 
CATGAAGAAGGATCATATCTCTTTTGGGAATTTGCCAGAGACAATTATC 

TTGAATGGAC^GACTCTCCAAACACTGCTGTCAGCGTGCATGTCAGTGAGTCCAGCGATGACGACGAGGA 
GGAAGAAGAAAACATCGGTTGTGAAGAGAAAGCCAAAAAGAATGCGAACAAG 
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GTGCCTGTGTACCGACGGGACKSTCATGAGGAGGTGTATGCTGGCAGC 

TCTATOTCCTCAAGTTTGACAACTCCTACT 

TACTAGATAAAAATGlTGTTACAAAGTCTGGAGTCrAG 

TTTCTTTTTACTTTTGTGGAGCATTAGAGTCACAGTCT 

CTCITGCTGGGAATCAAAATTTCCTTGAGACT 

AGACTCATCCAGCCCITGGGTGCTGACCAGCAGAGTCACTAGTGGATGCTG 

TTAAATATTTAAAGTCTCOUU^TAAAAC^^ 

CTGCCTGCTCCATGTGTCTTATGAGAGCCCGTAGTT^ 

AAGTCTATATX^GGTGCAGCTGGCTTTGATTAAAGGCCATTTTTAAAACT^ 

GATTATAATAGAAAAAGAAATGGCCTCAGTT^^ 

GTGCAGCTGTTTAGTTCAGAGTTATATTACAGAG^ 

AAAACTAATTGATAATTGAAGTATCAAGATACGTAGAACACCTCAGAGATTTTTOT 

AAACTTTGAATCCTTGTATCTTTATTTGGTATTCATACTACT^ 

TTTTGTTTTGGCTTCATAGAGTATCTCAAATTGAAACT 

TAAACTCAAATTGGCACCTACTGAATTAAAATACATAAAATCATTTAAATATAATTC^ 
TAACATTGCACTAATATGGAAATCACTGCCAGAGACAGTCTATTTTCTTTTAATTTGTTACTACTTAGTC 

ACAAACCCCACATTATTCC^GTTTGGAATT^^ 
AATTTTATAGCTTTAATTTGTGTTAT 

AACAAATATGGATTAATTGCCTCAAATTTGTATAAGTGAT 

AGTGGTATAGATAGAAAATGACAAAGATGGCAATATACACTT^ 

GTACTTAGATTTTTAAAATTTCAAATCCTAAA^ 

TACAGTTCACTAGATATGGGTTGTTTGAGTTTTT 

TTTTGTACATATCTAACTCTGTTCTCTTTT 

TAATTTGTGTTTTTTAATTTTGATTCAGAAGTAGTCC 

ACATTTGTCAGGCTCTCTGTCAGCTTTCATGTACATATGGTATAGAAACCATGGAGTTAGGCACTTCCTG 

GATTTTTTTTTTTATGAGAAAAATACTGTATTTAAAATGTAAAATAAACTTTTAAAAAGCAGGCACTAAT 

ATATATTTCTTCCAGCCTTTGATTACAAATTTGTCCTTGCACATGTTAAGATGAATTATCTCCTAAAAAT 

ATGATTGTTCTTGGGAGCAGTGTATGTTACTTTAGATAGCAGCGGTTCCTGTCATGTGTTCATG 

TATTTTTGGTTTTAAACTTTCTTATTGCCTTTGGCTGTTGATTAGTACAGTACAAGTGCGATTTCA^ 

GATCTTGAAAGTAATATATTTAATCAATTAAAATGTTTAT CTGTCAAAAAAAAAAAAAAA 

Human GOCAPl mRNA sequence - var7 (public gi: 25053702) (SEQ ID NO: 67) 

CGCTGAGGAGGTGCAGCAGCCGGAGATGGCGGCGGTGCTGAACGCAGAGCGACTCGAGGTGTCCGTCGAC 
GGCCTCACGCTCAGCCCGGACCCGGAGGAGCGGCCTGGGGCGGAGGGCGCCCCGCTGCTGCCGCCACCGC 
TGCCACCGCCCTCGCCACCTGGATCCGGTCGCGGCCCGGGCGCCTCAGGGGAGGAGCCCGAGCCCGGGGA 
GGCGGCGGCTGGGGGCGCGGCGGAGGAGGCGCGGCGGCTGGAGCAGCGCTGGGGTTTCGGCCTGGAGGAG 
TTGTACGGCCTGGCACTGCGCTTCTTCAAAGAAAAAGAT 

AATTGAAGCTTGTGGCACTGCATAAGGAAGTTCTTATGGGCCCATATAATCCAGACACT^ 

TGGATTCTTTGATGTGTTGGGGAATGACAGGAGGAGAGAATGGGCAGCCCTGGGAAA(^TGTCTAAAGAG 

GATGCCATGGTGGAGTTTGTCAAGCTCTTAAATAGGTGTTGCCATCTCTTTTCAACATATGTTGCGTCCC 

AO^AATAGAGAAGGAAGAGCAAGAAAAAAAAAAGGAAGGAGGAAGAGGAGCGAAGGCGGCGTGAAGAGG 

AAGAAAGAGAACGTCTGCAAAAGGAGGAAGAGAAACGTAGGAGAGAAGAAGAGGAAAGGCTTCGACGGGA 

GGAAGAGGAAAGGAGACGGATAGAAGAAGAAAGGCTTCGGTTGGAGCAGCAAAAGCAGCAGATAATGGCA 

GCTTTAAACTCCGAGACTGCCGTGCAGTTCCAGGAGTATGCAGCCCAACAGTATCCAGGGAACTACGAAC 

AGGAGCAAATTCTGATCCGCGAGT^ 

CCAGCTTGCACAGCAACAGGCAGCATTACAGAAAC^ 

ACATCATC^AAAGTGAATGCAACTGTACCAAGTAATATGAT 

CTGACAGCTCCGAAAAAGAACTGGAAC CAG AAG CTG CAGAAGAAG CC CTGGAGAATGGACCAAAAGAATC 
TCTTCCAGTAATAGCAGCTCCATCC^TGTGGAGACGACCT 

CAGGATGCAGATTCCGTGATTACAGTGGGCCGAGGAGAAGTGGTCACTGTTCGAGTACCCACCCATGAAG 
AAGGATCATATCTCTTTTGGGAATTTGCGA.CAGACAATTATGACATTGGGTTTGGGGTGTATTTTGAATG 
GACAGACTCTCCAAACACTGCTGTCAGCGTGCATGTCAGT 

GAAAAC^TCGGTTGTGAAGAGAAAGCCAAAAAGAATGCC^CAAGCCTTTGCTGGATGAGATTGTGCCTG 
TGTACCGACGGGACTGTCATGAGGAGGTGTATGCTGGCAGCCATCAATATCCAGGGAGAGGAGTCTATCT 
CCTCAAGTTTGACAACTCCTACTCTTTGTGGCGGTCAAAATCAGTCTACTACAG^ 

TAAAAATGTTGTTACAAAGTCTGGAGTCTAGGGTTGGGCAGAAGATGACATTTAATTTGGAAATTTCTTT 

TTACTTTTGTGGAGCATTAGAGTCACAGTTTACCTTATTGATATTGGTCTGATGGTTTGTGAACTCTTGC 

TGGGAATCAAAATTTCCTTGAGACTCTTTAGCATTCATACTTTGGGGTTAAAGGAGATTCCTCAGACTCA 

TCCAGCCCTTGGGTGCTGACCAGCAGAGTCACTAGTGGATGCTGAAGTTACATGAGCTACATGTTAAATA 

TTTAAAGTCTCCAAAATAAAACACCCCAACGTTGACCTT^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Human GOCAP1 mRNA sequence - var8 (public gi: 273392s) (SEQ ID NO: 68) : 

GAATTCCGTTGCTGTCGGAGCCCGTAGTTAC^ 

TATCAGGTACAGCTGGCTTTCATTAAAGGCCATTTTTAAA : 

AAT AGAAAAAGAAATGGCCT CAGTTTG ATCTCGTTCAGAATGACC CAG ATTGTTTT CTG CTTTGGGTGCA 

GCTGTTTAGTTCAGAGTTATATTACAGAGAATTATTTTCT 

TAATTCGATAATTGAAGTATCAAGATACGTAGAA(^CCTCAGAGATT 

1TTAGAATCCTTGTATCTTTATTTGGTATTCATAOT 

TGTTTTGTTTTGGCTTCATAGAGTATCTCAAATT^ 

TATAAAACTCAAATTGGCACCTACTGAATTAAAATACATAAAATGC^ 

AGTAACATTGCACTAATATGGAAATCACTGCCAGAGACAGT 

TCACAACCCCACATTATTC CAGTTTGGAATTACTTATTAAGGAGAATTGGAAA CTT 
AAATTTTATAGCTTTAATTTGTGTTATITCT 

AAACAAATATGGATTAATTGCCTCAAATTTGTATAAGTGATTGGCTAGTGATTCTTGT^ 
AGTGGTATAGATAGAAAATGAGAAAGATGGCAATATACAC 
GTACTTAGATTTTTAAAATTTCAAATCCTAAATCAC^ 
ACAGTTCACTACATATGGGTTGTTTGAGTTTTTTGTGTGCTGTA 

TTTGTACATATCTAACTCTGTTCTCTTXTGGTTGTTCAGAAACTGGATTTTTTTTTTC 

TAATTTGTGTTTTTTAATTTTGATTCAGAAGTAGTCCCAGCT 

ACATTTGTCAGGCTCTCTGTCAGCTTTC^ 

GATTTTTTTTTTATGAGAAAAATACTGTATTTAAAATGTAAAATAAACTTTTAAAAAGC 

Human GOCAP1 Protein sequence - varl (pubHc gi: 24496473) (SEQ ID NO: 240) 

MAAVIiNAERLEVSVDGLTLSPDPEERPGAEGAPLLPPPIjPPPSPPGSGRGPGASGEQPEPGEAAAGGAAE 
EARRLEQRWGFGLEELYGIiALRLFKEKDGKAFHPTYEEKIjKLVALHKQV 

drrrewaalgnmskedamvefvkllnrcchl^ 
eekrrreeeerlrreeeeririeeeriirleqqkqqimaalnsqta^ 

QEQHYQQYMQQLYQVQLAQQQAAIjQKQQEVWAGS SLPTS S KVNATVPSNMMPVNGQAKTHTDS SEKELE 
PEAAEEALENGPKESLPVIAAPS^TRPQIKDFQREDSAGCRFRDYSGRGEWTVRVPTHEEGSYLFWEF 
ATDNODIGFGVYFEWTDSPNTAVSVHVSESSDDDEEEEENIG 
VYAGSHQYPGRGVYIiIiKFDNSYSLWRSKSVYYRVYYTR 

Human GOCAP1 Protein sequence - var2 (public gi: 21961497) (SEQ ID NO: 241) 

RTRGCHLFSTWASHKIEKEEQEKKRKEEEERRRREEEERERLQKEEEKRRREEEERLRREEEERRRIEE 
ERLRLEQQKQQIMAAIdtfSQTAVQFQQYAAQQYPGN^ 

QKQQEVWAGSSLPTSSKVNAWPSNMMSVNGQAKTHTDSSEKELEPEA^ 
WTRPQIKDFKEKIQQDADSVITVGRGEWTVRVPTHEEGSYLFWEFATDNYDIGFGVYFEWTDS 
VHVSESSDDDEEEEENIGCEEKAKKNANKPLLDEIVPVYRRDCHEEVYAGSHQYPGR 
.WRS KS VYYRVY YTR 

Human GOCAP1 Protein sequence - var3 (public gi: 15799259) (SEQ ID NO: 242) 

MAAVLNAERLEVSVDGLTLSPDPEERPGAEGAPLLPPPLPPPSPPGSGRGPGASGEQPEPGEAAAGGAAE 

EARRLEQRWGFGLEELYGLAIiRFFKEKDGKAFHPTYEEKL 

DRRREWAALGNl^KEDAMVEFVKLmRCCHLFSTYVASHKIEK^ 

EEKRRREEEERLRREEEERRRIEEERLRLEQQKQQIMAALNSOTAV^ 

QEQHYQQYMMIjYQVQLAQQQAALQKQQEVWAGSSLPTSSKVNATVPSNMMSW 

PEAAEET^ENGPKESIjPVIAAPSMWTRPQIKDFKEKIQQDADSVITVGRGEVVTVR 

atdnydigfgvyfewtdspntavsvhvsessdddeeeeeni 
vyagshqypgrgvyllkfdnsyslwrsksvyyrvyytr 

Human GOCAP1 Protein sequence - var4 (public gi: 10438O61) (SEQ ID NO: 243) 

MAAVLNAERLEVSVDGLTLSPDPEERPGAEGAPLLPPPLPPPSPPGSGRGPGASGEQPEPGEAAAGGAAE 
EARRLEQRWGFGLEELYGI^RFFKEKDGKAFHPTYEEKLKLVALHKQVIMGPYNP 

DRRREWAALGNMS KEDAMVEFVKLLNRCCHLFSTYVASHKI EKEEQDKKRKEEEERRRREEEERERLQKE 

EEKRRREEEERLRREEEERRRIEEERLRLEQQKQQIMAALNSQTAVQFQQYAAQQYPGN^QQQILIRQL 

QEQHYQQ YMQQI1YQVQI1AQQQAALQKQQEVWAGSSLPTS S KVNATVPSNMMS VNGQAKTHTDS S EKELE 

PEAAEEALENGPKESLPVIAAPSMWTRPQIKDFKEKIQQDADSVITVGRGEVVTVRVPTHEEGSYIiB^ 

ATDNYDIGFGWFEWTDSPNTAVSVHVSESSDDDEEEEENIGCEEKAKKNANKPIiIiDEIVPW 

VYAGSHQYPGRGVYLLKFDNSYSLWRSKSVYYRVYYTR 
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Unigene Name: GOSR2 Unigene ID; Hs. 432552 

Human GGSR2 mRNA sequence - varl (public gi: 2316087) (SEQ ID NO: 69) 

ATGGATCCCCTGTTCCAGCAAACGC^^ 

CGGCAGACAAGCAGTCTGTGCACATAGTAGAAAACGA 

TCTAGAACGTCTGGAGATTTTGTCCAGC^^GGAGCCC^ 

GACCAGTTAAAGTATGATGTCCAGCACCTGCAGACTGCGCT 

GGGAGCAGCAGGAGAGACAGCGAGAAGAGCTTCTGTGTCX3AACGTTCACCACT 

CATACCAATGGACGAATCACTGC^GTTTAACTC 

ATTTTAGATGGGCACAATATTTTAGATGGACTGAGGACCC^ 

AGATCCCTGACATTGCCAACATGCTGGGCTTGTCCAACACAGTGA^ 

CCAGGACAAGTACTTTATGATAGGTGGGATGCTC 

Human GOSR2 mRNA sequence - var2 (pubHc gi: 3483524) (SEQ ID NO: 70) 

TTTTTTTTTTCAGGACAGATTGGCCTTTATACTAA 

TCCGGGGCTGGTC&CAGAAGGAAAAAGGTTGTAGTCCCT^ 

ACAGATGTCTCCACGGTGGGATCTGCCCAC7VCTGGCTGGGCAAAAT 

TATATTTCAGGGAAGAGAAGGGAACCAAAGAATTAGAGATACTAAAACTAGAGCT 

AAATCAC^ATCTTTGCCTACAC^ 

AGCCACAGTCTCTGAACCTTAAAAGCCAACCACTCTATT^ 

AGACTGAACACTCCCGGGAAATAACACTGGCCTCAC 

AAATCTTACTTGTTATCGGCAATAGCACTACATCTTC 

AGAAGTATTGGCAGTCAGTCATCTACATGTCACTGAGGCACAGGGGAGGGTGG 

TGAATCGACCTACTATTTAATATAATGGCTGTGAGAAAAGGCCTCTTTCCTTTCCTTTCCACTTTTGCTC 
CACCCTATCAGGAG 

Human GOSR2 mRNA sequence - var3 (public gi: 21961348) (SEQ ID NO: 71) 

GGCCTGCCGGGCCGGCGACATGGATCCCCTGTTCCAGCAAACGCACAAGCAGGTCC^ 
TGCATGGGACGCCTGGAGACGGCAGACAAGC^GTCTGTGCACATAGTAGAAAACGAAATCC^ 
TAGACCAGATATTCAGCCGTCTAGAACGTCTGGAGATTTTGTCC^ 
AAATGCCAGACTTCGGGTTGAC<^GTTAAAGTATC^ 

CAGCATCGGCGCCATGCAAGGGAGCAGCAGGAGAGACAGCGAGAAGAGCTTCTGTCTCGAACCTTCACCA 

OTAACGACTCTGACACCACCATACCAATGGACGAATCACTGCAGTTTAACTCCTCCCTCCAGAAAGTTGA 

CAACGGCATGGATGACCTCATTTTAGATGGGCACAATATTTTAGATGGACTGAGGACCCAGAGACTGACC 

TTGAAGGTGGGGTCCCTGCTGGGGGACAGAGAGAAGGCCTCTTGT^ 

ACTGTGTTTATATTTTGATTACGTGTC^ 

TGGGATCCTTTOTGTTGGAGTTGAGTTATTGTGAGCCTGAAAGTACCCAGTTCCT^ 

ACAAACCATGAAGTGGCCTCTCTTAGGATCCAGGTCTTTTCCCATTTACTGAACTTATCATGAAAGTGAG 
TGCTACTACGAGGGGTCCAATC^CAGGCTGAGAAATTGTGTTACAGAATCTACTCTTGGAAG^ 
CGTGGCTGTCCTTTGGTACCTCGCTTTAAGGTGGCTTTCCCTTAGGACCCCTACTGTGGACTGCCTTATA 
ACTAAAACCTTTTGTATTTTAGTAACTGAATCCCCACT 

TAGATTAGAGCTTTAGAAGCTTCTAGAGCTTCTAAAGCCCGTGCTGGTGATCCCAGCGACTCTTCACTCC 
CTAGCCTTAGGTATTCCTAGAAGCCCTGACCAGTTGGCACTGCTGAGACTCCAGCCCCTGGGAGTGGTTT 
ACAGAAACMTACACAGACTCTGATGTCAGTCATC 

TGATGGATAATAGGGCGTGGK3TTCTGTCTGTTATGAGGGTC * 

CACTGAGCTGTGAGAAACAGTCACTCGGAAGTGTGAGCTTTATCTTAGTTTTTGTTGGATG^ 

CTGTCAGCTCCACAGGACTTC^GTACGTT^ 

CAAGCTGCAAGTGACACTCACCTCCTGCTGACAGTTGCAGTGTCTCAGATGGCCTGGAAGGGTGGTCTCC 
AGCAGCCTGCTGGGCGCTCCCCTTTCATGAGAGCC^COTGCAGTGACCTGAAOTGATACATGTTGATTAG 
TCTGCCCTTTCTTTAGAAAACTGCTACTCTCCTT^ 

CCAAACTCCATGTGACACTGATGAAGAGCCAGTGGGC^ CTAG CTT 

CCCACTGAA.GTCTGGCAGCGCTGGGGCATCAGCACACCTCTTGCCA 

GCTGTGAGGTGGCTGGGGGTTGAGACTTGAGGTTTCTAACTTTCCTCTGCACACCTGTGGCTACCTGGTG 
TTTGTCTCTTGATTCCCTCCACCTGCCTCA^ 

AAGGAACATAGGAGAGGGC^TGAAGGGGCTAGGCTGAAGC^CTCTGATGACTGGGGCCAATTTGTG 

AAAATG^TACATTTTTTGAAATTTATGGTCATTTTC^GTGATTTAGAAGGTTGATCCTTAGCCTC^TA 

CAGTGATGAAATAATCTGTGTGTTCAGAGCC^GC^ 

TCTCGGGGAAAAAAAAATACAAATACATTTCTCTGATCT 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Human GOSR2 mRNA sequence var4 (public gi: 16905519) (SEQ ID NO: 72) 

GTTCCGAGGAAGCCAGAGCCGGAGCCGTGGCCT 

GCAC^\AGCAGGTCCACGAGATCCAGTCTTGCATGGGACGCCTGGAGACGG 
ATAGTAGAAAACGAAATCCAAGCAAGCATAGACCAGATATTCAGCCGTCT 
CC^GCAAGGAGCCCCCTAACAAAAGGCAAAATGCCAGACTT 

GCAC CTGCAGACTGCGCTCAGAAACTT CCAGCATCGGCGCCATGCAAGGGAGCAG CAGGAGAGACAGCGA 

GAAGAGCTTCTGTCTCGAACCTTCACCACTAACGACTCTGACACCACC^ 

AGTTTAACTCCTCCCTCCAGAAAGTTCACAACGGCATGGATGACCT 

AGATGGACTGAGGACCCAGAGACTGACCTTGAAGGGGACTCAGAAG 

CTGGGCTTGTCCAACACAGTGATGCGGCTCATCGAGAAGCGGGC 

GCACCCAAGGATCCTGCCAGACAGCACACTTTGGAGGAAGGTCTGCAGTC 

TTGAACTCTGGGAGGCAGAAGTCCCCGCACCCATCATGCGTGGACTGATAG 

GACCAGTGCTTTCCACACTTGACAGTGGTTGGCT 

CTCTAGTTTGGAATAAAAATTGCAGAGGTGGAAAAAAAAAAAAAAAAAAA 

Human GOSR2 mRNA sequence - var5 (pubUc gi: 12711466) (SEQ ID NO: 73) 

AGCCGGAGC05TGGCCTGCGGGGCCGGCGACATGGATCCCCT 

GAGATCCAGTCTTGCATGGGACGCCTGGAGACGGCAGACAAGCAGTCTGTGCACATAGTAGAAAAC 

TCCAAGCAAGCATAGACCAGATATTCAGCCGTCTAGAACGTCTGGAGATTTTGTCCAGCAA 

TAACAAAAGGCAAAATGCCAGACTTCGGGTTGACCAGTTAAAGTATGA 

OTCAGAAACTTCCAGGATCGGCGCCATGCAAGGGAGCAGCAGGAGAG 

GAACCTTCACCACTAACGACTCTGACACCACCATAC 

CCAGAAAGTTCACAACGGCATGGATGACCTCATTTTAGATGG 

CAGAGACTGACCTTGAAGGGGACTCAGAAGAAGATCCTTGACATTGCCAAC^ 

CAGTGATGCGGCTCATCGAGAAGCGGGCTTTCCAGGACAAGTACTTTATGATO 

CCAGACAGCACACTTTGGAGGAAGGTCTGCAGGGAGGAGOTGAGCCATT 

AGAAGTCCCCGCACCCATCATGCGTGGACTGATAGGACATCTTTTCGTG^ 

ACTTGACAGTGGTTGGCTTTGATGAACCCTCATGCTGCACCTTCAGAGCCAGTCCTCTAGTTTGGAATAA 
AAATTGCAGAGGTGGAAAAAAAAAAAAAAAAAAA 

Human GOSR2 mRNA sequence - var6 (public gi: 37805253) (SEQ ID NO: 74) 

CAATAGAGACAAGGTCTTGCTCTGTCACCCAGGTTGGAGTACAGT 

TCTACCTCTTGGGTTCAAGCGATCCTCCCACOTCGGTCTTCTGAGTAGCTGGGAATACAGT 

TCAATATGTTCCCACTGACTGAGGAAAACAAGGATGTGGCCCAGTTGTTGCTCAATACTGGTACTTG 

AAGATGTATCTTCAGATTCTGTGGTGTGGATTTTCATGCACCTTACAAACTTCCATACAAGA 

CTGAGATACAGAGAGGTTAAGCAACCTCCCAAAGTTCTAGGGTTACAGGTGTTAGCCACTGTACCTGGCC 

TCTAAGGTGATTCTGATGTGTGTATTTTGGAACCACTGTCTCCTAGACAGAAAGCTTCTGTCTCAAAGAT 

GATCAGATTGGTGTAAAGAGCAAAACI^GTTAAGTCCAAAATAAATTCTTACTGTTTATA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human GOSR2 mRNA sequence - var7 (public gi: 1690552 1) (SEQ ID NO: 75) 

GTTCCGAGGAAGCCAGAGCCGGAGCCGTGGCCTGCCGGGCCGGCGACATGGATCCCCTGTTCCAGCAAAC 
GCACAAGCAGGTCCACGAGATCCAGTCTTGCATGGGACGCCTGGAGACGGCAGACAAGCAGTCTGTGCAC 
ATAGTAGAAAACGAAATCCAAGC^^GCATAGACCAGATATTCAGCCGTCTAGAACGTCTGGAGATTTTGT 
CCAGCAAGGAGCCCCCTAACAAAAGGCAAAATGCCAGACTTCGGGTTGA 

GCACCTGCAGACTGCGCTCAGAAACTTCC^GCATCGGCGCCATGCAAGGGAGCAGCAGGAGAGAmGCGA 
GAAGAGCTTCTGTCTCGAACCTTCA.CCACTAACGACTCTGACACCACCATACCAATGGACGAATCACTGC 
AGTTTAACTCCTCCCTCCAGAAAGTTCACAACGGCATGGATGAC 

AGATGGACTGAGGACCCAGAGACTGACCTTGAAGGGGACTCAGAAGAAGATCCTTGACATTGCCAACATG 
CTGGGCTTGTCCAACACAGTGATGCGGCTCATCGAGAAGCGGGCTTTCCAGGACAAGTACTTTATGATAG 
GTGGGATGCTGCTGACCTGTGTGGTCATGTTCCTCGTGGTGCAGTACCTGACATGAGCCAGCCACGCTCA 
GTGGCTGAACAGCATTCCCACAGCCTGCAAGTGTGTGTGTGTGTGAAAGAGAGAGGGGGGCCCAGAGGCC 
GCCTTTTGAAATGTTTGCCTGTCTGAACTGTGAAGACACTTGGGAGTGATTGTGGTCTAATTTCCAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Hiunan GOSR2 protein sequence - varl (public gi: 163 0724 1) (SEQ ID NO: 244) 

MDPLFQQTHKQVHEIQSCMGRLETADKQSVHI 

DQLKYDVQHIKJTALRNFQHRRHAREQQERQREEliLSRTFTTNDSDTT I PMDES hQ FNSSLQKVHNGMDDL 
ILDGHNI LDGLRTQRLTLKGTQKKI IjDI ANMIiGLSNTVMRL I EKRAFQDKYFMI GGMLLTCWMFIiWQ Y 
LT 
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Human GOSR2 protein sequence - var2 (public gi: 16905522) (SEQ ID NO: 245) 

MDPL FQQTHKQVHE I QS CMGRLETADKQSVH I VENE I QAS I DQI FSRLERLE I DS S KEP PNKRQNARLRV 

IX)LKYDVQHIjQTALRNFQHRRHARE(X3ERQ 

IUX^IIiDGIiRTQRKrii^ 

Hitman GOSR2 protein sequence - var3 (public gi: 127H467) (SEQ ID NO: 246) 

MDPLFQQTHKQVHEIQSCMG^ 

DQLKYDVQHLQTAIjRNFQHRRHAREQQERQREELLSRTFTT^ 

I LDGHNT IiDGLRTQRIjTIjKGTQKKI LJD I ANMLGLSNTVMRXiI E KRAFQDK^ I GTQGS CQTAHFGGRS A 
GSS 1 

Human GOSR2 protein sequence - var4 (public gi: 21961349) (SEQ ID NO: 247) 

■MDPLFQQTHKQVHEIQSCMGR 

DQLKYDVQHLQTAIiRNFQHRRHAREQQERQREEIiIjSRTFTTNDSDTTIP 
IIiDGHNIIJDGI^TQRLTI,KVGSLnGDREK^CFSLIQQFSNCVyiLITCPQIVIF 

Human GOSR2 protein sequence - var5 (public gi: 23 ieo 8 8) (SEQ ID NO: 248) 

MDPLFQQTHKQVHE I QSCMGRLETADKQS VHI VENE IQAS IDQI FSRLERLE I LSSKEPPNKRQNARLRV 
DQLKYDVQHLQTALRNFQHRRHAREQQERQR^ 

IIjDGHNIIjDGLRTQRXiTIjKGTQKKI pdianmlglsntvmrliekrafqdkyfmiggmlltcvvmflvvqy 

IjT 

Human GOSR2 pray sequence - varl (SEQ ID NO: 76) 
agcgccgccatggnagtaccc^tncgacgt^^ 

cacccaagcagtggtatcaacgcagagtggccattatggccggaaaccggaaggggggctgtgaggacgt 
gttccgaggaagccagacccggagccgtggcctgccgggccggcgacatggatcccctgttccagcaaac 
gcacaagcaggtccacgagatccagtcttgcatgggacgcctggagacggcagacaagcagtctgtgcac 

atmtagaaaacgaaatco^caagc^^ 

ccagcaaggagccccctaacaaaaggcaaaatgccaaacttcgggttgaccagttaaagtatgat 
gcacctgcagactgcgctcagaaacttccagcatcggcgcnatgcaagggagcagcgggagagacagcga 

GAAGANCTTNTGTCTOTAACCTTAACCNNTACC^ANTTTGACNCCCCCTTO 

NGTTAACNTNCCTCCCNCNAAAAAGTTACAAACGGCTTGNNNAACT . 
TNAATNGCNTTGGGNNCCCAAAACCTTCCTTTNGNGGGGGGGNCCNTTTGGGGGGAAAAAAAAAANGCCC 

TTTTTTTTANCCCOtfNNNCAAI^TTO^ 
TTTTTsTNNAAANAAAAACNCCCCCCTNTGGGGGGGCCT 
-ACCCQnTTTNTNNGGimGGAAAAAAAAAA 

Human HERPUD 1 mRNA sequence - varl (public gi: igsotsoi) (SEQ ID NO: 77) 

AGAGACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCA 

GCGGAGCCCCGACACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTG 

AAGAGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCA 

AGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAA 

GCTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGGAAAAACGGCATGTTTTGCATCT 

TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGC 

CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCG 
GAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCATCACGTTGGGTGGTTTCCATTTAGACCGAGG 
CCGGTTCAGAACTTCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTAC 
AGGAAGGCACTGATCCTGAAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGA 
GCAGACCAGCCCCTCCTTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCA 

GAAGGCCCCCCAGCCATCGCAAACTGATGGTGTTTGTC 

CTGGATCACCTGACTCCAGCTAGATTGCCTCTCCTGGAGATGGCAATGATGAGTTTTTAAAAAACAG 
GGATGATGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAA 
AAATGCCCAAGGCTTCTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCA 
CTGTACGTAGAAGGCCTTAGGTGTTGCATGTCTATGCTT 

GTGTGTTTGTACATAGAAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATG 

TTTAAATTACACTAAGTGTACTACTTTATA^ 

CTAGGAAAGACTTATGTATAATTGCTTTTTAA 

GAGGTGAAAACCTTTGCTGGGTTTTCTGTTC^TAAAGTTTTACTATGAATGACCCTGAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAATVAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Human HERPUD1 mRNA sequence - var2 (public gi: 10441910) (SEQ ID NO: 78) 

GCTGTGTGGCCCAGGCTTTTCT 

TGGGACTACAGGCATGTGCCACTAGACCTGGCTCT^ 
CATTTCACAATGTTTATTCACATATATGGTATTAGTATTCTAATGTA 
ATATTTCCTAGAACATCTGAACAGAGCATAGGAAATTCCCTATTTTO 
CTTAAAAGCACTTTATCATTTCATTTCCCTGCA^ 

C^GGCTTACTTATATTGGAATACTATTTTAGAAAGTTGTGGGCTGGGTTGTATTTAT 
TCAGATGTCTGGAATGAGTAAATTTAGCACCATTATCAGGAAGCTTTCT 

AAGATTTTAATGAAAGTGTAGCATACTCTAGGGAAAAAATATGAATATTTTAGGATCTATGTA 
TTATGTTGAATAAATGTCAGACTATTTTTTACATAACGTTGCTTCTGCT 
TGGGGGGTAGGAGATGTAAGCCCTTGAC^GCAAAATAATTCCTTTTGCTTGAT^ 
GCTCCTTTGTTCTGTGTTCATGTTACAC^ 

ATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTC1TCGGAACC 
TTCCCCTGGATGGGAAAACATCTCAAGGCCIH3AAGCT 

TTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGGTTCCAGCAGATATATGCACGACA 
ACATGCAATATTTAGCAGCCACTGCTGGAT 

TGTGGTCTCTGCACCTGCTCC^GCCCCTATTCACAACCAGTTTCC^GCTGAAAACCAGCCTG 

AATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGGATGAATGC^ 

CTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGAC 

TGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCATGGTCATGG 

GTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACTTCCGAAATGA 

GTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTACAGGAAGGCACTGATCCT 

AGACCCC^CCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGA 

ACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCT 

GATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAAT 

TGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGTGGA 

AGCAAAAGGAGAAACGTGAAG CCGTGATACAAATTGGTGAAGAAAAAATGCCCAAG CTTCTCATGT CTT 
TATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCCTTAGGTGTT 
GCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACATAGAAGTCATAG 
ATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTGTACTACT 
TTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTATAATTGC 
TTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTTGCTGGGTTTT 
CTGTTGAATAAAGTTTTACTATGAATGACAAAAAAAAAAAAAAAAAA 

Human HERPUD1 mRNA sequence - var3 (public gi: 3005722) (SEQ ID NO: 79) 

GGCCACCTCAAGGCCCACCTGAGCCGCGTCTACCCCGAGCGTCC^CGTCCAGAGGACCAGAGGTTAATTT 
ATTCTGGGAAGCTGTTGTTGGATCACCAATGTCTC^ 

GCATCTGGTGTGCAATGTGAAGAGTCCTTCAAAAATGCCZAGAAATCAACGCCAAGGTGGCTGAATCG^ 

GAGGAGCCTGCTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGG 

AAGTTCTTCGGAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATT 

CCAAGGCCTGGGTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGGTTCC^ 

ATATATGCACGAGAGTACTACATGCAATATTTAGCAGC(^ 

CAAGTGCACAAGAGATACCTGTGGTCTCTGGACCTGCTCCAGCCC^ 

AAACCAGCCTGCCAATCAGAATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGCCAATCAAAATT 

ATGAATGCACAAGGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCT 

ATTGAGCAGCTACATTTTCTGTTTTTCTCAGTATCCTC^^ 

GGTCATGGGGGCCACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTT 
CAGAACTTGCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAACTTACyVGGAAG 
GCACTGATCCTGAAACTGAAGACCCCAACCACCTCCCTCGAGACAGGGATGTACTAGATGGCGAGCAGAC 
CAGCCCCTCCTTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGC 
CCCCCAGCCATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGAT 
CACCTGACTCC^GCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGTGGATGA 
TGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGC^ 

CCAAGGCTTCTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTAC 

GTAGAAGGCCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGT 

TTGTAGATAGAAGTCATAGATGGAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAA 

TTACACTAAGTGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGA 

AAGACTTATGTATAATTGCTTTTTAAAATGC^GTGCTTTACTTTAAAOTAAGGGGAACTOT 

AAAACCTTTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAAAAA7^AA 

AAAA 
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Human HERPUD 1 mRNA sequence - var4 (public gi: 21619175) (SEQ ID NO: 80) 

CCACGCX5TCCGGGTCX3TTGC^ 

CGGAGCCCCGACACCGCCX5CCGCCGCCATGGAGTCCGAGACCGAA 

AGAGCCCCAACCAGCGCCACCX^GACTTGGAGCTGAGTGGCGACCGCGGCTGGA^ 

GGCCCACCI^GCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAG^ 

CTGTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGC^^ 
TGTGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGG 

TGCTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGOT 

CGGAACCTTTCTTCCCCTCGATGGGAAAACATCTCAAGGCCTGAAGC 

TGGGTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTCT 

acgacagtactacatgcaatatttagcagccactgctgcatcaggggcttt^ 

caagagatacctgtggtctctgcacctgctccagcccctatt 

ctgccaatcagaatgctgctcctcaagtggttgttaatcct 

acaaggtggccctattgtggaagaagatgatgaaataaat^ 

gctacattttctgtttttctcagtatc 

gggccaccgttgttatgtacctgcatcacxsttgggtggtttccatt^ 

cccaaatgatggtcctcctcctgacgttgtaaatc^ 

cctgaaactgaagaccccaaccacctccctccagacagggatgtactagatggcgagcagaccagcccct 
cctttatgaggacagcatggcttgtcttcaagactttctttgcctctcttcttccagaaggccccc 

CATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTGAC 
TCCAGCTAGATTGCCTCTCCTGGACATGGC^TGATGAGTTTTTAAAAAACAGTGTGGATGATGATATGC 
TTTTGTGAGCAAGCAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAAAAATGC 
CTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGC 

CTTAGGTGTTGCATGTCTATGCTTGAGGAACT 

GAAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATT^GATTCCTGTTGGAATGTTTAAATTACACTAA 
GTGTACTACTTTATATAATCAATGAAATTGCTAGACATGT^ 

GTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTT 
GCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTGAAAAAAAAAAAAAAA 

Human HERPUD 1 mRNA sequence - var5 (public gi: 14249882) (SEQ ID NO: 81) 

AACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTG 

CGACACCGCCGCGGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTGAAGAGCCCC 

AACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGGCCCACC 

TGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCTGTTGTT 

GGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAA 

GTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCGAAGGTGGCT 

CTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCGGAACCT 
TTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCTGGGTCCT 
GGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGGTTCCAGCAGATATATGCACGACAGT 
ACTACATGCAATATTTAGCAGCGACTGCTGCATCAGGGGCTTTTC 

ACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAACCAGCCTGCCAA 
CAGAATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGGA 

GCCCTATTGTGGAAGAAGATGATG7VAATAAATCGAGATTGGTTGGATTGGACCTATTCAGCAGCTACATT 
TTCTGTTTOTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCATGGTCATGGGGGCCACC 
GTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACTTCCCAAATG 
ATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCT^CAATAACTTACAGGAAGGCACTGATCCTGAAAC 
TGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGCCCCTCCTTTATG 
AGCACAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGCCCCCCAGCCATCGG^ 
ACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCrGACTCCAGCTA 
GATTGCCTCTCCTGGACATGGGAATGATGAGTTTTTAAAAAAGAGTGTGGATGATGATATGCTTTTGTGA 
GCAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGT 

CTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCCTTAGGT 
GTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGGATGTGTGTTTGTACATAGAAGTCA 
X TAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTGTACT 
AOTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTA 
TGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTTGCTGGGT 
TTTCTGTTCAATAAAGTTTTACTATGAAAAAAAAAAAAAAAAAA 

Human HERPUD 1 mRNA sequence - var6 (public gi: 12652674) (SEQ ID NO: 82) 

GAACTGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCGGAGCC 
CCGACAC CGCCGCCGCCGC CATGGAGTCCGAGACCGAAC CCGAGCCCGTCACGCTCCTGGTGAAGAGCCC 
CAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGGCCCAC 
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CTGAGCCGCGTCTACCCCX3AGCXjTCCGCGTCCAGAGGACCAGAGGTTAA 

TGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCATGT^ 

TGTGAAGAGTCCTTGAAA^TGCCAGAAAT(^ 

TCTAATCXKSGGACAGTATCCTGAGGATTCCTC^ 

TTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTG 

TGGTTTCTCCGGTTACACACCCTATGGGTGGC^ 

TACTACATGCAATATTTAGCAGCCACTGCTGCAT 

TACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCAC^CCAGTTTCCA 
TCAGAATGCTGCTCCTCAAGTGGTTGTTAATCCIX3GAGCCAATCAAA 
GGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCT 
TTTCTGTTTTTCTCAGTATCCTCTACTTCTACT 

CGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCX3AGGCC 
GATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCGAACAA 

CTGAAGACCCCAACCACCTCCCT CCAGACAGGGATGT ACTAGATGGCGAGCAGACCAGC C CCTCCTTTAT 
GAG(^CAGCATGGCTTGTCTTGAAGACTTTCT 

AACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTG 

AGATTGCCTCTCCTGGAGATGGGAATGATGAGTTTTTAAAAAAG^ 

AGCAAGCAAAAGCAGAAA05TGAAGCCGTGATACAAATTGGTGAACAAAA 

TCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTT 

TGTTGCATGTCTATGCTTGAGGAACTTTTCC^J^^ 

ATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAGTGTAC 
TACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTATAA 
TTGCTTTTTAAAATGCAGTGCTTTACTTTA/^ 

TTTTCTGTTCAATAAAGTTTTACTATGAATGAAAAAAAAAAAAAAAAAAAA 

Human HERPUD1 mKNA sequence - var7 (public gi: 9711684) (SEQ ID NO: 83) 

agagacgtgaactgtcgttgcagagattgcgggcggctgagacgccgcctgcctggcacctaggagcgca 

gcggagccccgacaccgccgccgccgccatggagtccgagaccgaacccgagcccgtcacgctccrggtg 

aagagccccaaccagcgccaccgcgacttggagctgagtggcgaccgcggctggagtgtgggccacctca 

aggcccacctgagccgcgtctaccccgagcgtccgcgtccagaggaccagaggttaatttattctgggaa 

gctgttgttggatcaccaatgtctc^gggacttgct^ 

gtgtgc^tgtgaagagtccttcaaaaatgccagaaatcaacgccaaggtggctgaatccacagaggagc 

ctgctggttctaatcggggacagtatcctgaggattcctcaagtgatggtttaaggcaaagggaagttct 

tcggaacctttcttcccctggatgggaaaacatctgaaggcctgaagctgcccagca 

ctgggtcctggtttctccggttacac^ccctatgggtg 

cacgacagtactacatgcaatatttagcagccactgc 

ac^gagatacctgtggtctctgcacctgctc<^^ 

cctgccaatcagaatgctgctcctcaagtggttgttaatcctggagcgaatcaaaatttgcggatga 
cac^ggtggccctattgtggaagaagatgatgaaataaatcgagattggttggattggacctattcagc 

AGCTACATTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCAGATTCCTCATGGTCATG 

GGGGCCACCGTTGTTATGTACCTGC^TGACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAACT 

TCCCAAATGATGGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAAC^^ 

TCCTGAAACTGAAGACCCG^CCACCTCCCTCCAGACAGGG^^ 

TCCTTTATGAGCACAGCATGGCTTGTCTTCAAGACT 

CCATCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCACCTGA 

CTCCAGOTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACAGTGTGGATGATGATATG 

CTTTTGTGAGGAAGCAAAAGCAGAAACGTGAAGCCGTGATACAAATTGGTGAACAAAAAATGCCCA 

TTCTCATGTCTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAG 

GCCTTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACA 

TAGAAGTCATAGATGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACT 

i^GTGTACTACTTTATATAATC^TGAAATTGCTAGA(^TGTTTTAGC7VGGACTTTTCTAGGAAAGACTT 

ATGTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCT 

TTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTG 

Human HERPUD1 mRNA sequence - var8 (public gi: 3005718) (SEQ ID NO: 84) 

GACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCG 
GAGCCCCGACACCGCCGCCGCCGCCATGGAGTCCGAGACCGAACCCGAGCCCGTCACGCTCCTGGTGAAG 
AGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGG 
CCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCT 
GTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGC^GGAAAAACGGCATGTTTTGCATCTGGTG 
TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAG 

CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCG 
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r ACArTACTA^ATGCAA^ 
GCCAATCAGAATGCTG^TCOT 

• a^gg^ccta-^ 

^CA^CTGT^CTCAGTATC^ 

gccaSotto?™™^ 

CAAATGATCGTCCTCCTCCTGACGTTGTAAATCAGGACCCCAACAATAA 

TGARflCTGAAGACCcSACCACCTCCCTCOVGACAGGGATGTACT 

TTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCITTGCCTCTCT 

Sg^a^^ctcc^catggc^tgatgagtttttaaaaaacagtgtg^ 
SgtcaSSgcaaaagcagaaacgtgaagccgtgatacaaatt^ 

ttaggtgttccatgtctatgcttgaggaacttttccaaatgtgtg 
IagISaSaS^ 

tgtactactttatataatcaatgaaattgctagacatgtt 

ctgggttttctgttcaataaagttttactatgaatgaccctgaaaaaaaaaaaaaaaaaaaaaa 




AGATACCTGTGGTCTCTGCACCTGCTCC^GCCCCTATTCA<^CCAGTTTC(^GCTGAAAACCAGCCTGC 

CAATCAGAATGCTGCTCCTC^GTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGGAT^ 

GOTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCTATTCAGCAGCTA 

^™ctctSttctcagtatcctctacttctactcctccctgagcagattcc 

^CGTTGTTATGTACCTGCATCACGTTGGGTGGTTTCCATTTAGACCGAGGCCGGTTCAGAAC^CC^ 

SSgatggtcctcctcotgacgttgtaaatcaggaccccaacaataacttacagga^^ 

AAACTGAAGACCCCAACCACCTCCCTCCAGACAGGGATGTACTAGATGGCXSAGCAGACGAGCCCCTCC^ 

SgIgcaSg^^ct^ 

GCAAACTCATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGATCA 

gc^at?gcctctcctggacatggcaatga^ 

gtgagcaagcaaaagcagaaacgtgaagccgtgatacaaattggtgaacaaaa^ 

ATGTGTTTATTCTGAAGAGCTTTAATATATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCCT^ 

AGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCTG<^TGTGTGTTTGTACATAGAA 

GTCATAGATGCAGAAGTGGTTCTGCTGGTAAGATTTGATTCCTGTTGGAATGTTTAAATTACACTAA 

TACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGAAAGACTTATGTA 

TAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTTTGCGGAGGTGAAAACCTTTGCT 

GGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCCTG 

Human HERPUD1 raRNA sequence - varlO (public gi: 7661869) (SEQ ED NO: 86) 

GACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAGGAGCGCAGCG 
C^GCCCCGACACCGCCGCCGCCGCCATGGAGTCCG^C^CCCGAGCCCGTC^CGCTCC^TGAAG 
AGCCCC^CCAGCGCCACCGCGACTTGGAGCTGAGTGGCGACCGCGGCTGGAGTGTGGGCCACCTCAAGG 
CCCACCTGAGCCGCGTCTACCCCGAGCGTCCGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCT 
GTTGTTGGATCACCAATGTCTCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCATGTTTTGCATCTGGTG 
TGCAATGTGAAGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGAGCCTG 
CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGGAAGTTCTTCG 
GAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCCAGCAGGCATTCCAAGGCCTG 
GOTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTCAGCTTTCCTGGTTCCAGOVGATATATGCAC 
GACAGTACTACATGCAATATTTAGCAGCCACTGCTGCATCAGGGGCTTTTGTTCCACCACCAAGTGCACA 
AGAGATACCTGTGGTCTCTGCACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAACCAGCCT 
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GCCAATCAGAJVFGCTGCTCCTCAAGTGGTTGTT 

AAGGTGGCCCTATTCTGGAAGAAGATGATGAAATAAATCGAGATTGGTTG 

TAGATTTTCTGTTTTTCTCAGTATCCTCT 

GCCACCGTTGTTATGTACCTGCATCACGTTGGGTC^ 

CAAATGATGGTCCTCCTCCTGACGTTGTAAATC^GGACCCCAACAATAACTTACAG 

TGAAACTGLAAC1ACCCCAACCACCTCCCTC 

TTTATGAGCACAGCATGGCTTGTCTTCAAGACTTTCT^ 

TCGCAAACTGATGGTGTTTGTGCTGTAGCT 

CAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTT^ 

TTGTGAGC^GC^AAAGCAGAAACGTGAAGCCGTC^^ 

TCATGTCTTTATTCTGAAGAGCTITAATAT^^ 

TTAGGTGTTGCATGTCTATGCTTGAGGAACTTTTCCAAATGTGTGTGTCT 

AAGTCATAGATGCAGAAGTGG1TCTGCTGGTACGATTTGATTCCTGTTC 

TGTACTACTTTATATAATCAATGAAATTGCTAG^ 

TATAATTGCTTTTTAAAATGCAGTGCTTTACT 

CTGGGTTTTCTGTTCAATAAAGTTTTACTAT 

Human HERPUD1 Protein sequence - varl (public gi: 16507802) (SEQ ID NO: 249) 

MESETEPEPVTLLVKSPNQRHRDLEI>SGDRGWSVGHLKAHLSRVYPERPR 

DLLPKEKRHVLHLVCNVKSPSKMPEINA^ - 

ISRHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNNNIjQEGTDPETEDPNHIjPPDRDVIjDG 

AWIiVFKTFFASLLPEGPPAIAN 

Human HERPUD1 Protein sequence - var2 (public gi: 10441911) (SEQ ID NO: 250) 

MQYIiAATAASGAFTOPPSAQEIPWSAPAPAPIHNQFPAENQP 
IVEEDDEINRDWLDWTYSAATFSVFLSILYFYSSLSRFI^^ 

PPPDVWQDPNNNLQEGTDPETEDPNHLPPDRDVLDGEQTSPSFMSTAWLVFKTFFASIjLPEGPPAIAN 

Human HERPUD1 Protein sequence - var3 (public gi: 3005723) (SEQ ID NO: 251) 
ghlkahlsrwperprpedqrliysgkllijdhqc^ 

EEPAGSNRGQYPEDSSSDGIjRQREVLRNLSSPGWENISRPEAAQQAFQGLGPGFSGYTPYGWLQLSWFQQ 

I YARQYYMQ YIiAATAASGAFVP PPSAQE I P WS AP APAP I HNQFP AENQP ANQNAAPQ VWNPGANQNLR 

MNAQGGP:i^EDDEINRDWIiDWTYSAATFS 

QNFPNDGPPPDVVNQDPNNNLQEGTDPETEDPNHLPPDRD^ 

PPAIAN 

Human HERPUD1 Protein sequence - var4 (public gi: 7661870) (SEQ ID NO: 252) 

MESETEPEPVTI^VKSPNQRHRDLELSGDRGWSVGHLBCAHLSRVYPERPRPEDQRLIYSGKLLIiDHQCI^ 

DliLPKQEKRHVLHLVCNVKS PSKMPE XNAKVAE STEEP AGSNRGQYPEDSS SDGLRQREVLRNLSS PGWE 

NISRPEAAMAFQGLGPGFSGYTPYGWLQLSWFQQIYARQYYMQYLAATAASGAFVPPPSAQEIPVVSAP 

APAPIHNQFPAENQPANQNAAPQVVVNPGANQNLRMNAQGGPIVEEDDEINRDW 

LYFYSSLSRFLM\WGATVVMYLHHVGWFPFRPRPVQNF 

PPDRDVIiDGEQTS PS FMSTAWLVFKTFFASLLPEGPPAI AN 



Unigene Name : HLA-A Unigene ID : Hs . 181244 Clone ID : GD_159 
Human HLA-A niRNA sequence - varl (public gi: 575248) (SEQ ID NO: 87) 

ATGGCCGTCATGGCGCCCCGAACCCTCGTCCTGCTACTCTCGGGGGCTCTGGCCCTGACCCAGACCTGGG 
CGGGCTCTCACTCCATGAGGTATTTCTTCACATCCGTGTCCCGGCCCGGCCGCGGGGAGCCCCGCTTCAT 
CGCAGTGGGCTACGTGGACGACACGCAGTTCGTGCGGTTCGACAGCGACGCCGCGAGCCAGAGGATGGAG 
CCGCGGGCGCCGTGGATAGAGCAGGAGGGTCCGGAGTATTGGGACGGGGAGACACGGAAAGTGAAGGCCC 
ACTCACAGACTCACCGAGTGGACCTGGGGACCCTGCGCGGCTACTACAACCAGAGCGAGGCCGGTTCTCA 
CACCGTCCAGAGGATGTATGGCTGCGACGTGGGGTCGGACTGGCGCTTCCTCCGCGGGTACCACCAGTAC 
GCCTACGACGGCAAGGATTACATCGCCCTGAAAGAGGACCTGCGCTCTTGGACCGCGGCGGACATGGCAG 
CTCAGACCACCAAGCACAAGTGGGAGGCGGCCCATGTGGCGGAGCAGTTGAGAGCCTACCTGGAGGGCGA 
GTGCGTGGAGTGGCTCCGCAGATACCTGGAGAACGGGAAGGAGACGCTGCAGCGCACGGACGCCCCCAAA 
ACGCATATGACTCACCACGCTGTCTCTGACCATGAAGCCACCCTGAGGTGCTGGGCCCTGAGCTTCTACC 
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CTG CGG AGATCACACTGAC CTGGCAG CGGGATGGGGAGG AC CAGAC CCAGGACACGGAGCTCGTGG AGAC 
CAGGC CTGC AGGGGATGGAAC CTTC C AGAAGTGGGCGGCTGTGGTGGTGC CTTCTGGACAGGAGC^ 
TACACCTGCCATGTGCAGCATGAGGGTTTGCCCAAGCCCCTCACCCTGAGATGG^ 
CCACGATCCCGATCGTGGGCATCATTGCTGGC^ 

CGCTGCTGTGATGTGGAGGAGGAAGAGCTCAGATAGAAAAGGAGGGAGCTACT 
GACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGTAAAGTGTGA 

Human HLA-A mRNA sequence - var2 (public gi: 187857) (SEQ ID NO: 88) 

ATGGCCGTCATGGCGCCCCGAACCCTCGTCCTGCTACTCTCGGGGGCCCTGGCCCTGACCCAGACCTGGG 
CGGGCTCCCACTCCATGAGGTATTTCTACACTTCCGTGTCCCGGCCCGGCCGCGGGGAGCCCCGCTTCIAT 
CGCCGTGGGCTACGTGGACGACACGCAGTTCGTGCGGTTCGACAGCGACGCCGCGAGCCAGAGGATGGAG 
CCGCGGGCGCCGTGGATAGAGCAGGAGGGGCCGGAGTATTGGGACCGGAACACACGGAATGTGAAGGCCC 
AGTC^CAGACTGACCGAGTGGACCTGGGGACCCTGCGCGGCTACTACAACCAGAGCGAGGCCGGTTCTCA 
CACCATCCAGATGATGTATGGCTGCGACGTGGGGTCGGACGGGCGCTTCCTCCGCGGGTACCGGCAGGAC 
GCCTACGACGGGAAGGATTACATCGCCCTGAAAGAGGACCTGCGCTCTTGGACCGCGGCGGACATGGCAG 
CTCAGACCACCAAGCACAAGTGGGAGGCGGCCCATGTGGCGGAGCAGTGGAGAGCCTACCTGGAGGGCAC 
GTGCGTGGAGTGGCTCCGCAGATACCTGGAGAACGGGAAGGAGACGCTGCAGCGCACGGACGCCCCCAAA 
ACGCATATGACTCACCACGCTGTCTCTGACCATGAAGCCACCCTGAGGTGCTGGGCCCTGAGCTTCTACC 
CTGCGGAGATCACACTGACCTGGCAGCGGGATGGGGAGGACCAGACCCAGGACACGGAGCTCGTGGAGAC 
CAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGTGGCTGTGGTGGTGCCTTCTGGACAGGAGCAGAGA 
TACACCTGCCATGTGCAGCATGAGGGTTTGCCCAAGCCCCTCACCCTGAGATGGGAGCCGTCTTCCCAGC 
CCACCATCCCCATCGTGGGCATCATTGCTGGCCTGGTTCTCTTTGGAGCTGTGATCACTGGAGCTGTGGT 
CGCTGCTGTGATGTGGAGGAGGAAGAGCTCAGATAGAAAAGGAGGGAGCTACTCTCAGGCTGCAAGCAGT 
GACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGTAAAGTGTGA 

Human HL A- A protein sequence - varl (public gi: 5 1 s 2 4 9) (SEQ ID NO: 253) 

MAVMAPRTIiVLIiLSGAIiALTQTWAGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDT 
PRAPWIEQEGPEYWDGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHT^ 
AYDGKDYIALKEDLRSWTAADMAAQTTKHKW 
THMTHHAVSDHKATIiRCWALSFYPAEITL 

YTCHVQHEGLPKPLTLRWEPS SQPT I P IVGI IAGLVLFGAVITGAWAAVMWRRKS SDRKGGS YSQAASS 
DSAQGSDVSLTACKV 
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Unigene Name : HLA-B Unigene ID : Hs. 77961 Clone ID : 3GD_1122 
Human HLA-B mRNA sequence - varl (public gi: 3 2 18 8) (SEQ ID NO: 89) 

ATGCGGGTCACGGCGCCCCGAACCGTCCTCCTGCTGCTCTC^ 
CCGGCTCCCACTCCATGAGGTATTTCTACA^ 

CTCAGTGGGCTACGTGGACGACACGCAGTTCGTGAGGTTCGAGAGCGACGCCGCGAGTCCGAGAGAGGAG 
CCGCGGGCGCCGTGGATAGAGCAGGAGGGGCCGGAGTATTGGGACCGOT^ 

AGACACAGACTTACCGAGAGAGCCTGCGGAACCTGCGCGGCTACTACAACCAGAGCGAGGCCGGGTCTCA 
.CACCCTCCAGAGGATGTACGGCTGCGACGTGGGGCCGGACGGGCGCCTCCTCCGCGGGCATGACCAGTCC 
GCCTACGACGGCAAGGATTACATCGCCCTGAACGAGGACCTGAGCTCCTGGACCGCGGCGGACACGGCGG 
CTCAGATCACCCAGCGCAAGTGGGAGGCGGCCCGTGAGGCGGAGCAGCTGAGAGCCTACCTGGAGGGCCT 
GTGCGTGGAGTGGCTCCGCAGATACCTGGAGAACGGGAAGGAGACGCTGCAGCGCGCGGACCCCCCAAAG 
ACACATGTGACCCACCACCCCATCTCTGACCATGAGGCCACCC^ 
CTGCGGAGATCACACTGACCTGGCAGCGGGATGGCGAGGACGAAACT 
CAGACCAGCAGGAGATAGAACCTTCCAGAAGTGGGCAGC 
TACACATGCCATGTACAGCATGAGGGGCTGCCG^^ 
CCACCATCCCCATCGTGGGCATTGTTGCTGGCCTG^ 

CGCTACTGTGATGTGTAGGAGGAAGAGCTCAGGTGGAAAAGGAGGGAGCTACTCTCAGGCTGCGTCCAGC 
GACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGA 



Human HLA-B protein sequence - varl (public gi: 3 2 18 9) (SEQ ID NO: 254) 

MRVTAPRTVIiLIiLSGAIiALTETWAGSHSMRYFYTAMSRPGRGE 

PRAPW I EQEGPEYVTORETQ IS KTNTQTYRE SLRNLRGYYNQS EAG SHTLQRMYGCDVGPDGRLLRGHDQ S 

AYDGKDYIALNEDLSSWTAADTAAQITQRK^^ 

THVTHHPISDHEATLRCWALGFYPAEITLTWQ 

YTCHVQHEGLPKPLTLRWEPSSQSTIPIVGIVA^ 

DSAQGSDVSLTA 

Unigene Name : MSTP028 Unigene ID : Hs. 302746 Clone ID : GD_1119 

Human MSTP028 mKNA sequence - varl (public gi: 14042294) (SEQ ID NO: 90) 

CCCCGCCTCCGCCCCCGGCTGGCGTGAGCTGGGTGTTTCCTGCCTCTCTCAGTCCGGGTTTGGAGACTCC 
TGCGTCCTCCGACTTTTCGTGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGCGGCTG 
CTACCCGCACCACTTCCTTCAAGGGCACGAGCCCCAGCTCCAAATACGT 

CCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCCATGTTCAGCGGGCGC 

ATGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTCATTGACCGCTGTGGGAAGGACTTTGGTACGATAC 

TCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCCGCCGGGAGATCGAGGAGCTGCTAGCAGA 

AGCCAAGTACTACCTAGTCCAAGGCCTGGTGGAAGAGTGCCAGGCGGCC^ 

GAGCCTTTCTGCAAGGTCCCTGTGATCACCTCATCCAAGGAAGAACAA 

AGC CAGCCGTGAAGTTGCTCTACAACAGAAGTAACAACAAAT ACT CATATACCAG CAATTCTGACGACAA 
TATGTTGAAAAACATTGAACTGTTTGATAAGCTGTCTCTGCGCTTTAACGGAAGGGTCCTGTTCATAAAG 
GATGTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTGAGGGCCGGAAGATTGCTGAAGTCTGTT 
GTACCTCCATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCGGATTTATGA 
GGAGACCCTGAACATTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCTGACAATGCGCTCCTGGAGGCC 
ACAGGCGGGGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGAGCGGGAGCGGATCGAGCGCGTGC 
GGAGGATCCACATCAAGCGCCCTGATGACCGGGCCCACCTCCACCAGTGAGCAGGCAAGAGACCGAGCCG 
CCCTCCTCTC^CCGCCCCCACTCCCTGCCGTGCTACACCCAGATCCTGTGCAGGCTGCCGGGCCCCTTCT 
GCTTCCC1TGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTATTTTGGTATTGCTTGACAAGGCA 
AATTCATTGTCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTGTCTAAGATCTCTACTTTTC 
ATGAGAATCTGAGACTCTTTGGAGCCAGGCTTTCTC 

TGTATGTGAGAACTTTTGTTTGCAATATTTATTTTTGTGGGTGTCGGCTTCCTATGTGGGCTTTTTGGGT 

GACACTCCCTTAAGGGTTCAGTTTGACAATTCTGAGAGTTGTCCTGCAGTTGGAGGCCACC^GAGGTATC 

TGAGOTCCCTGCTTCCTATTTCATAATCCTCCAGCCCCAGCAGGTCCACTCCTGGTTCCTGTGTGTTTGG 

CCCGGGCACAATCCCCACTGCTTTGCTAGACGTGCTTTCTGCCATGTGGCTTTGGGCCTAGAGCTTGTTG 

ATAATTGCAGCTTGTGGCAGGGGAAATATGGCTGAATGAGCGTCTAAATCGTTGAGACCAGTGCAACTTT 

GGGTGCAAGGCTTTGTTTAGGGATCAAGCCTTTTGCCACCTTGGGCTGGTCTTTGGCCTGGTGCTCACTG 

GGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGTGTCCAGCTCTGTTTCTGGTTCT 

TCCCCAACTCC^GCCCCGTCCAGTTGTTCTCCTGATTGACCCGACTCCACTCCAGGAAGGCCATCTGACC 
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CTGTGACAGGCATAGCTCATAAACTACCCCT 

AGCTGGGCTAACTTTCTAAGTCATTTTGCTTAGAAAT^ 

TGGCATCCAGGCAGGGACACCCTCACACCA.CCAGCCCCAGGG 

TTGTCTTTGCCTCTGATTTTTACACAGTGT^ 

ATAGATAACTTTGGGTCTGGTTTGTGTCT^ 

GGACGTGTGCTTGTGGGGCACAGGTGGCGGCCCCTGCTGGAGCCCGGCTGGGCGC^ 
CGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCrC^ 

ACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATGTGAATG 
GGGTTTGTCAAGGGAATCGTTACCGTTTTATACGAAAGG 
TCCAAAAACGAGTTTATATTTTCAAACGGTTTTTACAGCTTAGAC 
CAGTTGTATGCCTTCATTTTGTATCCAACAGGAAA 

Human MSTP028 mRNA sequence - var2 (public gi: 13994352) (SEQ ID NO: 91) 

GGAGACTCCTGCGTCCTCCGACTTTTCATGGAAGAGATGTCAGGAGAAAGTGTGGTGA 
CAGCGGCTGCTACCCGCACCACTTCCTTCAAGGGCACGAGCCCCAGCT 

GGGTGGAGCCCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGG^ 
AGCGGGCGCATGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTC^ 

GTACGATACTCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCCGCCGGGAGATCGAGGAGCT 

GCTAGCAGAAGCCAAGTACTACCTAGTCC^GGCCTGGTGGAAGAGTGCCAGGCGGCCOT 

GATACTTATGAGCCTTTCTGCAAGGTCCCTGTGATCACCT 

CTTCAAATAAGCCAGCCGTGAAGTTGCTCTACAACAGAAGTAACAACAAATACT 
TGACGACAATATGTTGAAAAACATTGAACTGTTTGATAAGCTGTC 

TTCATAAAGGATGTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGGCCGGAAGATTGCTG 
. AAGTCTGTTGTACCTCCATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCG 
GATTTATGAGGAGACCCTGAACATTTTGCTGTATGAGGCCCAGGATGGCCGGGGACCTGACAATGCGCTC 
CTGGAGGCCACAGGCGGGGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGAGCGGGAGCGGATCG 
AGCGCGTGCGGAGGATCCACATCAAGCGCCCTGATGACCG 

ACCGAGCCGGCCTCCTCTCACCGCCCCCACTCCCTGCCGTGCTACACCCAGATCCTGTGCAGGCTGCCGG 
GCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACTTT^ 

GACAAGGCAAATTGATTGTCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTGTCTAAGATCT 
CTACTTTTCATGAGAATCTGAGACTCTTTGGAG 

AGTGTGAAGTGTATGTGAGAACTTTTGTTTGCAATATTTATTTTTGTGGGTGTCGACTTCCTA 
TTTTTGGGTGACACTCCCTTAAGGGTTCAGTTTGACA^ 

AGAGGTATCTGAGCTCCCTGCTTCCTATTTCATAATCCTCCAGCCCCAGCAGGTCCACTCCTGGTTCCTG 
TGTGTTTGGCCCGGGCAGAATCCCCACTGCTTTGCTAGACGTGC 

AGC^TGTTGATAATTGGAGCTTGTGGCAGTGGAAATATGGCTGAATGAGCGTCTAAATCGTTC^ 

TGCAACTTTGGGTGCAAGGCTTTGTTTAGGGATCAAGCCTTTTGCCACCTTGGGCTGGTCTT^ 

TGCTCACTGGGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGTGTCCAGCTCTGTTT 

CTGGTTCTTTCCCCAACTCCAGCCCCGTCCAGTTGTTCTCCTGATTGACCCGACTCCACTCCAGGAAGGC 

CATCTGACCCTGTGACAGGCATAGCTCATAAACTACCCCTCCCTGGGATCCCGCTCCTCTTCAGCCTCCT 

TCCCCATGAAGCTGGGCTAACTTTCTAAGTCATTTTGCTTAGAAATTCAGTGTGGCCCATACCCT 

CTCCCAGCCTGGCATCCAGGCAGGGACACCCTCACACC^CC^GCCCCAGGGAGCTTCCCTGCTATAAACA 

CAGACCCCCTTGTCTTTGCCTCTGATTTTTACACAGTGTAGAGTGGCCAGCAGTGAACAGGTTGAGGATG 

TGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATGTTTGTTTAAGGGATATGTGTGAC 

TGTGGGTGGGGACGTGTGCTTGTGGGGCACAGGTGGCGGCCCCTGCTGGAGCCCGGCTGGGCGCAGCGCC 

TATGTAGGACGGGTGTTCTGAGTGACCTACCTCCCAGGCT 

TCGTGACTGACA.GGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACGAGGAGATGTGAATGTGCGT 

GTGATGGATGGGTTTGTCAAGGGAATCGTTACCGTTTTATACGAAAGGTATTAACATGGGC^ 

CAGATGTATTCCAAAAACGAGTTTATATTTTCAAACGGTTTTTACAGCra 

TGCCTGTGACAGTTGTATGCCTTCATTTTGTATCCAACAGCAAAGTCTACAATAAAACT 

ATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human MSTP028 mRNA sequence - var3 (public gi: 25303941) (SEQ ID NO: 92) 

CCGGGTTTGGAGACTCCTGCGTCCTCCGACTTTTCATGGAAGAGATGTCAGGAGAAAGTGTGGTGAGOTC^ 
AGCGGTGCCAGCGGCTGCTACCCGCACCACTTCCT 

CTGAATGTGGGTGGAGCCCTCTACTATACCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGG 
CCATGTTCAGCGGGCGCATGGAAGTGCTCACCGACAGTGAAGGCTGGATCCTCATTGACCGCTGTGGGAA 
GCACTTTGGTACGATACTCAACTACCTTCGAGACGGGGCGGTGCCTTTACCCGAGAGCCGCCGGGAGATC 
GAGGAGCTGCTAGCAGAAGCCAAGTACTACCTAGTCCAAGGCCTGGTGGAAGAGTGCCAGGCGGCCCTAC 
AAAACAAAGATACTTATGAGCCTTTCTGCAAGGTCCCTGTGAT 

TATAGCGACTTCAAATAAGCCAGCCGTGAAGTTGCTCTACAACAGAAGTAACAACAAATACTCATATACC 
AGCAATTCTGACGACAATATGTTGAAAAACATTGAACTGTTTGATAAGCTGTOTCTGCGCTTTAACGG^ 
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GGGTCCTGTTC^TAAAGGATGTCA^ 

GATTGCTGAAGTCTGTTGTACCTCCATCG 

GAAGCCCGGATTTATGAGGAGACCCT6AACATTTTGCTC 

ATGCGCTCCTGGAGGCCAGAGGCGGGGCGGCGGGGCGCTCCCACC^ 

GCGGATCGAGCGCGTGCGGAGGATCGAGATC^ 

GGCAAGAGACCGAGCCGCCCTCCTCT(^CCGCCCCCACT^ 

GCTGCCGGGCCCCTTCTGCTTCCCTTGGAGCCTGGAGATACTTTTGTAACAAG 

TATTGCTTGACAAGGCAAATTGATTGTCTTGA 

TAAGATCTCTACTTTTCATGAGAATCTGAGACT 

TATGAATGAGTGTGAAGTGTATGTGAGAACTTTTC 

ATGTGGGCTTTTTGGGTGACACTCCCTTAAGGGTTC^ 

AGGCCACCAGAGGTATCTGAGCTCCCTOCTTCCTATTTC^TAATCCT 

GGTTCCTGTGTGTTTGGCCCGGGCACAATCCCCACTGCTTTGCTAGACGTGC 

GGGCCTAGAGCTTGTTGATAATTGC^GCTTGTGGCAGTGGAAATATGGCTGAA^ 

GAGACCAGTGCAACTTTGGGTGCAAGGCTTTGTTTAGGGATCAAGCCT 

TGGCCTGGTGOTCACTGGGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGTG 

CTCTGTTTCTGGTTCTTTCCCCAACTCCAGCCCCGTCCAGTTGTTCTCCTGATTGACCCGA 

AGGAAGGCCATCTGACCCTGTGACAGGCATAGCTCATAAACTACCCOT 

AGCCTCCTTCCCCATGAAGCTGGGCTAACTTTCTAAGTCATTTTGCTTAGAAATTCAGTG 

CCTTTGTCCTCCCAGCCTGGCATCCAGGGAGGGACACCCTCACA^ 

TATAAACACAGACCCCCTTGTCITTGCCrCTGAOTTTTTACAC^ 

TGAGGATGTGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATGTTTGTTTAAGGGATG 
TGTGACTGTGGGTGGGGACGTGTGCTTGTGGGGCACAGGTGGCCCCTGCTGGAGCCCGGCTGGGCGCAGC 
GCCTATGTAGGACGGGTGTTCTCAGTGACCTACCTCCCAGGCTCCTCTGCACCTGCAAAGGAACAGGAGT 
GAGTCGTGACTGACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATGTGAATGTGCGT 
CAGGTGATGGATGGGTTTGTCIAAGGGAATCGTTACCGTTTTATACCAAAGGTATTAAGATGGGCAGCCTT 
TGACACATGTATTCCAAAAACGAGTTTATATTTTG&AAC 

CCCTGCCTGTGACAGTTGTATGCCTTCATTTTGTATCCAACAGCAAAGTCTACAATAAAACTTTAAAAC^ 
ATCATGACTGAATGTCAAAATC^TGTATTGGGCAGATGCTTTTTAAACTGTCGTGTGAGAAACTTTTATA 
TTAGGCCATTTGGATTTTATTAAGTGCTAAGGAAAGAGGGCTTACAAAATGTTTCGTAAATATTTTATAC 
TGTTTAAGTGTTAAAGACCAACCCTGTCTTTCTTTTGGGTTGAGCTTTTTTAGAAAG 

TGGAGTGTAGTGGCACCATCTCCACTTACCACAACTTGTGCCTCCTGGGTTCAAGCGATTCTGCTGCCTC 
AGCCTCCCGAGTAGOTGGGATTGCAGGTACC<^TCAGCCCATGCC(^GCTAATTTTGTATTTTTAGTAGA 
GATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCCTGTGATCCGACCACCTTGGCCTC 
CCAAAGTGCTGGGATTACAGGTGTGAGTCACCACACCTGGCTC 

AATGAGTTTGTGGAACAATTTGATTTGCTGTGGCCTCTATGCCTAATGAGCTAGTGTTTCTGGC^GCTCT 
CTCTACCCAACTTTGCACTTGTAGTTTTGAGTCTTTGTCTCTCTGGAATATGAACAGGTTTATAAAACAT 
TCCATGGTGAAGAAOTCTGTCGGCTGCATTATAGCCATGAGTG 

CTGTTATTGAGTATACAAGGAACTGATTTTTCTTATGTTAGCACTAAGGGCAAAAACCAATATTTATAAT 

GTAAGCACTAT CCAGGTAAAACACTGGC CCAAGATTTGGTAAAGAGATTTCATTGCAATGTAATAACTAC 

AGTTTTTTACAAATTGGAACAGCTTTGGTGTGTCGTAATCAAGGGTTTTTTTTGTTTGTTTGT^ 

AAGCCATCTGATTGTGGTGACTGGGGCCCATGTCCAAGACAATTCCTGGCATATTCTGTCACCCTCCCGT 

GGGGCGATCACTGTGTGGGGACCCCATTCCCCAGTTAAAGTGTGTCTCTGTACCTTACAACAGCGATTCA 

GGACCCAAGTGTGAACAACACTCAGCCCGCCCTCTGGAGCGTGTGCTGTCTTTAGGGCTCTACCCAAAGT 

CACTGTAACAGTTAAGTGTGTCATTAACCTTTCTGTCTCTTTGCGCCATAAAAAAATGCTCAAAGTTTTA 

GATGTAGCCACTGTATGTTGTACA7VACGTTGGCGACATGTAAAATAAAAGTCATAAAATGCAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human MSTP028 tnRNA-sequence - var4 (pubHc gi: 16552440) (SEQ ID NO: 93) 

AGTCCGGGTTTGGAGACTCCTGCGTCCTCCGACTTTTCATGGGCCCTGACATGGCAGGTGATATCCAGGA 
CACTGTTGGGTGCCATGGAGTTGGGGAGAGTTGGCCAGAAGAGTTGGATAACCTTGAATTGAATATTGTC 
CATGACTGTCGCGTTGCTTCTGCTGTTGCAAGCTGCCTCCCTTTACTCCCAGTCCCATTTACAAAAATAA 
CGCTTGTGTTTTACCAGTTATAGTTGTAGTACCCATTC^TTATAGAAAATCTGGAAAAGCTAGACAATTC 
TTTTTCAGTTT<^GGGAATAGTTCAAACAAGTTATGTC 

AGGATACCTGTAGTCAAGCAAAGTTGGGTTTATTTCCTTGTTGCATTGGGGTGGGGAAGAACTGTGGGAC 

ATCTCAGAGAAGGGCTGTGGGCTTGTGTTGGGTGATTTGAGAGACAGTTCAGAGAAGTGGGGCTTTGCTC 

TGTGTTGGATGCTGCTGGGAAGCAGGGCTAATTCTGTGATTGK3GTCTCAGTGATTCCTGACTTGAAAGCA 

GGAAGAATGGAAGGAGGCTAAACTTCTGATTGGTAAAGCAGCAGCTGTAACTCCTATTAGCCAG 

GGATCTTTGGTCATTTTTGTATTTTGGATAATGTTTATGTTTTTGTCTGTGTCCGGACATGATGACTGAA 

TGGTCCTGTTTTTGTCTTGTCGCAAGGGCACAGAGTGGCCTTGTCTGAGGGTGATGTGCTGTGAAAAACT 

GTTGATGTTCAATGGGAATGGTAGGGCCAGCCGTGGGGGCTACCCCAGATTCAGCAAAGATTCTGCCCAC 

CCTTGCACATTTCCACCTCTAGAGTTTTACCTC 
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CGGATACCAATCTCACTTTCCM^CCTC 

AGTCAGCCATGCTATCAGGGAGATGGTAGTGGGATCTTGCTCTTTTGGGCAGCA 
AATTTTGCAATAACTTGGTTCCAAAAGTTTCCATGTCT 

CAGGAAGTGATAGGAGTGCGAGCTGGAATCCCATTCAACTTCATAAAGCTm 

TGAAAAATGACACTTAGCTAGCTATTGAGTGGTAC^TGGCAATAAGGAAA 

GCTTTAGGCTGTTTTAGGGTGCAGCCAGGGTGTTCATGTATACAGG 

TAACACAAGAGTTAGGGGGACCCTTGTGCCTGCAGGGTCGACAGGCA 

GGTGGGTCTTGGGACAAACTAGGGGATGCATGGCCCTCTCTAGG 

AGTTGTTCCCCTGCTAGCCCAGTTGGCOTCTGATTTTAGGAGAAGC 

CTCTCCTTAGTTGACCACATTGGAAGGAAACTTT^^ 

-CTGGAGGCCAGATTGAATCCACAGGCTGAAAGCAGTCAACCAGGCOT 

CCACTGGCAGGAAGAGATGTC^GGAGAAAGTGTGGTGAGCTCAGCGGTGCCA 

ACTTCCTTCAAGGGCACGAGCCCCAGCTCCAAATACGTGAAGCTGAA.TGTGG 

CCACCATGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCCATO 

CACCGACAGTGAAGGCTGGATCCTCATTGACCGCTGT^ 

CGAGACGGGGCGGTGCCTTTACCCX^GAGCCXXZCGGGAGATCGAGGAGCTGCTAGC^GAAGCCAAGTACT 

ACCTAGTCCAAGGCCTGGTGGAAGAGTGCGAGGCGGCCCTACAACAGAACAAAGATACTTATGA 

CTGCAAGGTCCCTGTGATCACCTCATCGAAG 

GTGAAGTTGCTCTAC^CAGAAGTAACAACAAATACTCATATACC^GCAATTCTGACGACAATATGTTGA 

AAAACATTGAACTGTTTGATAAGCTGTCTCTGCGCTTTAACGGAAGGGTCCTGTTCATAAAGGA 

TGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGGCCGGAAGATTGCTGAAGTCTGTTGTACCTCC 

ATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAGTTTCCCGAAGCCCGGATTTATGAGGAGACCC 

TGAACATTTTGCTGTATGAGGCCCAGGATGGCCGG 

GGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGAGCGGGAGCGGATCGAGCGCGTGCGGAGGATC 
CACATCAAGCGCCCTGATGACCGGGCCGACCTCCACCAGTGAGCAGGC^GAGACCGAGCCGCCCTCCTC 
TCACCGCCCCC^CTCCCTGCCGTGCTACACCCAGATCCTGTGCAGGCTGCCGGGCCCCTTCTGCTTCCCT 
TGGAGCCTGGAGATACTTTTGTAACAAGCCAGATGATTATTTTGGTATTGCTTGACAAGGCAAATTGATT 
GTCTTGACCCAGGCGTATGACCCCTGTCGTTGAACAAGCTGTC 

CTGAGACTCTTTGGAGCCAGGCTTTCTCGGTTCTCAGAGGAAAAGTATGAATGAGTGTGAAGTGTATGTG 

Human MSTP028 mRNA sequence - var5 (pubHcgi:2i750697)(SEQIDNO:94) 

GCTGGCGTGAGCTGGGTGTTTCCTGCCTCTCTCAGTCCGGGTTTGGAGACTCCTGCGTCCTCCGACTTTT 

CATGGAAGAGATGTCAGGAGAAAGTGTGGTGAGCTCAGCGGTGCCAGCGGCTGCTACCCGCACCACTTCC 

TTCAAGGGCACGAGCCCCAGCTCCAAATACGTGAAGCTGAATGTGGGTGGAGCCCTCTACTATACCACCA 

TGCAGACGCTGACCAAGCAGGACACCATGCTGAAGGCCATGTCCAGC^ 

(^GTGAAGAACAAAGATACTTATGAGCCTTTCTGG^GGTCC^ 

AAAACTTATAGCGACTTCAAATAAGCCAGCCGTGAAGTTGCTOTACAACAGAAGTAACAACA 

TATACCAGCGATTCTGACGACAATATGTTGAAAAACATTGAACTGTTTGATAAGCTGTCTCTGCGCTTTA 

ACGGAAGGGTCCTGTTCATAAAGGATGTTATTGGGGATGAAATCTGCTGCTGGTCCTTTTATGGTCAGGG 

CCGGAAGATTGCTGAAGTCTGTTGTACCTCCATCGTCTATGCCACTGAGAAGAAACAGACCAAGGTGGAG 

TTTCCCGAAGCCCGGATTTATGAGGAGACCCTGAACATTTTGQTGTATGAGGCCCAGGGTGGCCGGGGAC 

CTGACAATGCGCTCCTGGAGGCCACAGGCGGGGCGGCGGGGCGCTCCCACCACCTGGACGAGGACGAGGA 

GCGGGAGCGGATCGAGCGCGTGCGGAGGATCCACATCT^AGCGCCCTGATGACCGGGCCCACCTCCACCAG 

TGAGCAGGCAAGAGACCGAGCCGCCCTCCTCTC^VCCGCCCCCACTCCCTGCCGTGCTACACCCAGATCCT 

GTGCAGGCTGCCGGGCCCCTTCTGCTTCCCTTG 

TTTTGGTATTGCTTGACAAGGCAAATTGATTGTCTTGAC^ 

TGTGTCTAAGATCTCTACTTTTCATGAGAATCTGAGA 

GAAAAGTATGAATGAGTGTGAAGTGTATGTGAGAACTTTTGTTTGCAATATTTATTTTTGTGGGTGTCGA 
CTTCCTGTGTGGGCTTTTTGGGTGACACTCCCTTAAGGGTT^ 

AGtTGGAGGCCACCAGAGGTATCTGAGCTCCCTGCTTCCTATTTCATAATCCTCCAGCCCCAGCAGGTCC 
ACTCCTGGTTCCTGTGTGTTTGGCCCGGGCACAATCCCC^ 

GGCTTTGGGCCTAGAGCTTGTTGATAATTGCAGCTTGTGGCAGTGGAAATATGGCTGAATGAGCGTCTAA 
ATCGTTGAGACC^GTGCAACTTTGGGTGCAAGGCTTTGTTT^ 

GGTCTTTGGCCTGGTGCTCACTGGGACCCCATATGTCTGCGTAGGAGCAGAACTTTCCATGGCAGTAAGT 
GTCCAGCTCTGTTTCTGGTTCTTTCCCCAACT 

CACTCCAGGAAGGCCATCTGACCCTGTGACAGGCATAGCTCATAAACTACCCCTCCCTGGGATCCCGCTC 

CTCTTCAGCCTCCTTCCCCATGAAGCTGGGCTAACTTTCTAAGTCATTTTGCTTAGAAATTCAGTC 

CC^TACCCTTTGTCCTCCC^GCCTGGCATCC^^ 

CCCTGCTATAAACACAGACCCCCTTGTOTTTGCCTCTGATTTTTACACAGTGTAGAGTGGCCAGCAGTGA 
AGAGGTTGAGGATGTGCGGGTAGATAGATAACTTTGGGTCTGGTTTGTGTCTGTGTTCATGTTTGTTTAA' 
GGGATATGTGTGACTGTGGGTGGGGACGTGTGCTTGTGGGGCACAGGTGGCGGCCCCTGCTGGAGCCTGG 
CTGGGCGCAGCGCCTATGTAGGAOSGGTGTTCTCAGTGACCTACCTCCa^GGCTCCTCTGCACCTGCAAA 
GGAACAGGAGTGAGTCGTGACTGACAGGGGTGGTTGAGACTAGACTAGGTAGAGTAGTTACCAGGAGATG 
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TGAATGTGCGTCAGGTGATGGATGGGT 

TGGGCAGCCTTTGACACATGTATTCCAAAAACGAGTTTATATTTT 

TGTACTTACTGCC CTGCCTGTGACAGTTGTATGCCTTCATTTTGT^^ CAAAGTCTACAAT AAA 

ACTTTAAAACAATCATG 

Human MSTP028 Protein sequence - varl -(public gi: 13994353) (SEQ ID NO: 255) 

meemsgeswssavpaaatrttsfkgtspsskyvklnvggalyyt™ 

segwilidrcgkhfgtilnyi*rdgavplpesi^eieellaeak^ 

•pvitsskeeqkliatsnkpavkxiiiynrsnnkysytsnsddnmijkni elifdklslrfngrvlfi kdvigde 

I CCWS FYGQGRKI AEVCCTS I VYATE KKQTKVE FPEARI YEETLNI LL YEAQDGRGPDNALLEATGGAAG 
RSHHIiDEDEERERIERVRRIHIKRPDDRAHXiHO 

Human MSTP028 Protein sequence - var2 (public gi: 14042295) (SEQ ID NO: 256) 

MSGES WSS AVPAAATRTTS FKGTS PS SKYVKLNVGGALYYTTMQTLTKQDTMLKAMFSGRMEVLTDSEG 

WILIDRCGKHFGTILNYIjRDGAVPLPESRREIEELIiAEAKYYLVQGLV^ 

TSSKEEQKIjIATSNKPAVKIiLYimSNNKYSYTSNSDDNMLKN 

WSFYGQGRKI AEVCCTS I VYATE KKQTKVE FPEARI YEETLNI LLYEAQDGRGPDNALLEATGGAAGRSH 
HLDEDEERERIERVRRIHIKRPDDRAHLHQ 

Pnicjene Name: PACS-1 Unigene ID; Hs. 58589 

Human PACS-1 mRNA sequence - varl (public gi: 27781345) (SEQ ID NO: 95) 

AGC^CGAGTCTGGTTGTGCCGGAGAAAGTC^WVACTCCC^^ 

CTGCCTCCCCCAGCAAAGTGGAGGGGGTGCACAC^CCCCGGCAGAAGAGGAGCACGCCCCTGAAGGAGCG 
GCAGCTCTCCAAGCCCCT^AAGTGAGAGGACCAAGAGTTCCGACAGCGAGCGCT 

AGCACGCAGATTCCAAGAAAGGTGGTGTATGACCAGCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCC 
CAGAAAATGTCATTCTGGTGAACACCACTGACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACCA 
GCGGAAGCCTGTGGTGTGCACCTGCTCCACCGTGGAGGTCC^GG 

ATCCAGCGCTACTGCAACTGCAACTCTTCCATGCCGAGGCCAGTGAAGGTGGCTGCTGTG 

GCTACCTGAGCTCCATCCTCAGGTTCTTTGT(^AGTCCCTGGCCAACAAGACCTCCGACTGGCTTGGCTA 

C^TGCGCTTCCTCATC^TCCCCCTCGGTTCTCACCCTGTGGCCAAATACTTGGGGTC^TCGA<^GTAA^ 

TAC^GTAGTTCCTTCCTGGATTCTGGTTGGAGAGATCTGTT(^GTCGCTCGGAGCC^CCAGTGTC^GAGC 

AACTGGACGTGGCAGGGCGGGTGATGCAGTACGTGAACGGGGGAGCCACGA 

CGAAGCCATGCTGACTTGCCGGCATAAGTTCCCTGATGAAGACTCCTATCAGAAGTTTATTCCCT 

GGCGTGGTGAAGGTGGGTC^GGTTGAAGACTCTCCCTCCACAGC^GGCGATGGGGACGATTCTCCTGTG^ 

TCAGCCTTACTGTGCCCTCGACATCACCACCCTCCAGCT 

TCCCTCCTCCCCATCTATGAGCAGCGCCCTGGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTG 

ATTGGCCTCCAGGTGGACTACTGGCTGGGCCACCCCGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACG 

CCAGCTCGAAGAACACCCTC^AGAGTGTCTTCCGCTCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGA 

GGCCCAGCTTTCTGGCACCATGGCCATGACTGTGGTC^CCAAAGAAAAGAACAAGAAAGTTCCCACCATC 

TTCCTGAGCAAGAAACCCCGAGAAAAGGAGGTGGATTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCC 

TCATCTGCTCAGCCAAGCAGCAGCAGACTATGCTGAGAGTGTCCATCGATGGGGTCGAGTGGAGTGACAT 

CAAGTTCTTCCAGCTGGCAGCCGAGTGGCCCACCCATGTCAAGCACTTTCCAGTGGGACTCTT(^ 

AGCAAGGCCACCTGAGGCCCTGTCTCCCAGCC^CTTTCCCTCCTGGCACTGCCACCAGCCT 

CGGGGA.GGGGGAGGCCAGCAGGCCCGGGCCCAGCACCCCTTCCCTGGCACCAGGGTCTGCCTOT 

CCCAGGTCCCGAAGGAC^CTGCCACAGGGACGCCTTCCCTCCCCTCCCCTCCAGCCCACCCCTGCACAGC 

CCCTCCTCCTTCCCGCTTTTCCCCTTCTCCCTCCTGCTC 

GTGCCCATAGTCCCCTGGACTGAGTCCCCCAGGCCTTCCTTCACCCGACTTCCAAACTCTTCCTTGTGGT 
ATCAGTTTCCTTCTCGGAAATGAGAAAGCTGGAATCCTGGTCCCCAGCAGGAGAGCCTAGTCCTCCCCCA 
GCCCCTCCAGCCACGAGGGTGTCCTCTAGGATGCAGCTGCCAGATCCACTCACT 
GACCCAAGGCCACTTTCAACTCTTATGGGGTTCTCCACCTGCCCCAGAGCTTCT 

GGCACCCTGAGCCCACAGGACCCCTACTTG^CAGCTCACAGGGGCAGGAGGCAGCTCCCCTGCCTCCAGG 

ACCCTGTTGCTATGGTGAC^CAGCGTTTCTAGGAGAGAGGGGCCTCCCAGTCTCCCCCCACCACCCGTGC 

ACGACTTCCTCACCACCCCCAGGTTCCCTGCAGATGTCGTGTGTGTCCTGAGTGTTTCTTTGGTTCTTTG 

C^CGCCAAGTCTCTTGGTTGTACCATGTGACAC^CCCTGTGCACTGGTCGCTGTCTTCGTGGCT 

CTTGTTAATGATGCTCCTGCCTCTGCCTCCCAGCCCCTCACCCAGCACAGOTCTGCCTGGACTTGGAGAG 

ATGGGAGGGAGACCCCCACCACGATAC^TGCTGTCTGTGGCCCCTCAGACAT^ 

CATCTCCCTCCTCCCACCGTGTCAGTTTTTCTGCCTTTCCCTGCTCTGT^ 

AGCCTGGGCCCAGACCCATCCTCCC^GCCAGGTTTCCCTCGA^ 

CTCTCACCAACCCGGGGTCTGAGCCCCTCATTCCTGACCG 

<3GGCCC(^GCC<^GCCCTCTGCACCCCCC^G^ 

•TCTTGTCCTCTCACTCCTTCCCAGAAGTTTTTGCACAGAACTTCATTTTGAAAGTGTTTTTCTCATTC 
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CATACCTCCCCCAAGCTCTCCTCCAGC 

CAGAGAGAGGAGATGGGGGTGGGAGGGACTGAGTTGATGTT^ 
CTTAAAAAAAAAAAAAAAAAA 

Human PACS-1 raRNA sequence- var2 (public gi: 30962845) (SEQ ID NO: 96) 

CCTCGGCCTCCGTAACCCCCGCCTAGCCGGGCCATG 

GCGCCGGGGGCX^GGAGCGGCCAGCGGGGATCCGGGGTCGCCC^ 

G<^GCAGCAGCAGCCGCCGCAGCAGCCGACGCCCCCC7UVGCTGGCCCAG^ 

ACCTCGGCGGCGGCTGCOTCCTCCTCX3TCCTCGTCTACCTCCACCTCCATGG 

GCTCCGCGCCTCCCGGTGGCCCGGGGCCAGGCCGCACCCCCGCCCCGGTGCAG 

CTGGGAGGTGGACCGGAGCTCGTCCAGCTGCGTGCCTAGGCTATTCAGCTTC 

ATGCTAAAAGAAATGGAGAAAGATCTTAACTCAGTGGT 

TTCrrCGCTCCAACGAGATCGTCCTTCGAGCTAGTGGACTGGTGGAAACAGAGCT 

CCTTCAGTACCCTCATTTCCTTAAGCGAGATGCGAACAAGCTGCAGATCATGCTGCAA 

TACAAGAATCGGACCATCTTGGGCTATAAGACCTTG 

AGCATCCTAATGAAGGCGCACTGGTGCTTGGCCTACACAGCAACGTGAAGGATG 

AGAAATAAAGATCTACTCCCTGTCCAGCCAACCCATTGACCATGAAGGAATCAAATCCAAGCTTTCT^ 

CGTTCTCCTGATATTGACAATTATTCTGAGGAAGAGGAAGAGAGTTTCTCATCAGAACA 

ATGATCGATTGGATGGGCAGGACTTGTTCTACGAAGACGAAG 

GAAACTAACCTCAACCTCTGCCATGACAAGGCAACCTAACATCAAACA^ 

CGGTTTAAAGTTTCAGATGAGGTGGGCTTTGGGCTGGAGCATC 

AAGAGGACTTGGATGAATTGTATGACAGTCTGGAGATGTACAACCCCAGCGACAGTGGCCCTGAGATGGA 
GGAGACAGAAAGCATCCTCAGCACGCCAAAGCCCAAGCTCAAGCCTTTCT 

AGCTCC(^GACGGAGATTGGCAGCCTCAACAGC^U^GG<^GCCTCGGAAAAGACACCACCAGCCCTATG^ 
AATTGGCTGCTCTAGAAAAAATTAAATCTACTTGGATTAAAAACCAAGATGACAGCTTGACTGAAACAGA 
CACTCTGGAAATCACTGACCAGGACATGTTTGGAGATGCCAGC^ 
AAAACTCCCATGAAGTCCAGTAAAACGGATCTCCAGGGCTCTGCCT 

AC^CACCCCGGCAGAAGAGGAGCACGCCCCTGAAGGAGCGGCAGCTCTCCAAGCCCCTAAGTGAGAGGAC 

CAA(^GTTCCGACAGCGAGCGCTCCCCAGATCTGGGCCAmGCACGCAGATTCCAAGAAAGGTGGTGTAT 

GACCAGCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCCtoGAAAATGTCATTCTGGTGAACACCACTG 

ACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTCCAC 

CGTGGAGGTCCAGGCCGTGCTGTCCGCCCTGCTCACCCGGATCCAGCGCTACTGCAACTGCAACTCT^ 

ATGCCGAGGCGAGTG7VAGGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCCATCCTCAGGTTCTTTG 

TCAAGTCCCTGGCCAACAAGACCTCCGACTGGCTTGGCTACATGCGCT^ 

TCACCCTGTGGCCAAATACTTGGGGTCAGTCGACAGTAAATACAGT 

AGAGATCTGTTCAGTCGCTCGGAGCCACCAGTGTCAGAGGAACTGGACGTGGCAGGGCGGGTGATGCAGT 
ACGTC^CGGGG<^GCCACGA<^CACCAGCTTCCCGTGGCCGAAGCCATGCTGACOTGCCGGCATAAGTT 
CCCTGATGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGT^AGAC 
TCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCT^ 

CCTCC^GCTCGGGCCTGAGCCGAGACGCC^CGGCCACCCCTCCCTCCTCCCCATCTATGAGCAGCGCCCT 
GGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGGC 
CACCCCGGGGAGCGGAGGAGGGAAGGCGACT^GAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTCT 
TCCGCTCAGTGCAGGTGTCCCGCCTGCCCGATAGTGGGGAGGCCCAGCTTTCTGGCACCATGGCCATGAC 
TGTGGTCACCAAAGAAAAGAACAAGAAAGTTCCCACCATCTTCCTGAGCAAGAAACCCCGAGAAAAGGAG 
GTGGATTCTAAGAGCGAGGTCATTGAAGGCATCAGCCGCCTCATCTC 

TGCTGAGAGTGTCCATCGATGGGGTCGAGTGGAGTGACATCAAGTTCTTCCAGCTGGCAGCCCAGTGGCC 

CACCCATGTCAAGGACTTTCCAGTGGGACTCTTCAGTGGCAGCAAGGCCACCTGAC^ 

CCACTTTCCCTCCTGGCACTGCCACCAGCCTCACCGCCTGCGGGCAGGGGGAGGCCAGCAGGCCCGGGCC 

(^GCACCCOTTCCCTGGCACCAGGGTCTGCCTCTCACTCGCCCAGGTCCCGAAGGAC^ 

CGCCTTCCCTCCCCTCCCCTCCAGCCCACCCCTGCACAGCCC^ 

OTCCTGCTCCAGGCCCAAGGCGTGTTGGTTTTGCCTTCTGGTGCCCATAGTCGCCTGGACTGAGTCCCCC 

AGGCCTTCCTTCACCCGACTTCCAAACTCTTCCTTGTGGTATCAGTTTCCTTCTCGGAAATGAGAAAGCT 

GGAATCCTGGTCCCCAGCAGGAGAGCCTAGTCCTCCCCC^GCCCCTCGAGCCACGAGGGTGTCCTCT 

ATGCAGCTGCCAGATCCACTCACTCTGCTGCCTCCAGCAGGACCCAAGGCCACTTTC^CTCTTATGG^ 

TTCTCCACCTGCCCCAGAGCTTCTCAAGGGAGGGTAAGGGGGCACCCTGAGCCCACAGGACCCCTACTTC 

ACAGCTCACAGGGGCAGGAGGCAGCTCCCCTGCCTCGAGGACCCTGTTGCTATGGTGACACAGCGTTTCT 

AGGACAGAGGGGCCTCCCAGTCTCCCCCCACCACCCGTGCACGACTTCCTCACCACCCCCAGGTTCCCTG 

CAGATGTdGTGTGTGTCCTGAGTGTTTCTTTGGTTCTTTGCACGCC^GTCTCTTGGTTGTACCATGTGA 

CACACCCTGTGCACTGGTCGCTGTCTTCGTGGCTTCGACCCTTGTTAATGATGCTCCTGCCTCTGCCTCC 

CAGCCCCTCACCCAGCACAGCTCTGCCTGGACTTGGAGAGATGGGAGGCAGACCCCCACCACCATACATG 

CTGTCTGTGGCCCCTCAGACATTCTGTTTCATCTCC<^TTC^TCTCCCTCCTCCCACCGTGT<^GTTTTO 

CTGCCTTTCCCTGCTCTGTTCTTCCCCCTCCTT^^ 

GGTTTCCCTCC^GCAGGCTCCTTCCCTCCCTGTCACCTCCCTCTCACC^CCCGGGGTCTGAGCCCCTCA 
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TTCCTCACCGTCCGTGTTCT^ 

GCCCGGCCATCTGCGCCCCACAGCCCCTTTGGAGCTTTTCTCTTGTCCTCT 
TTGCACAGAACTTCATTTTGAAAGTGTTTTTCTCA 

TCCCAGGGCTCAGCCCTG CTGTC CTGAGCX3TCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACT 

GAGTTGATGTTGGGTTTTTCATTCAATAAATTGGTGATTTCT 

AAAAAAAAAAAAAAAA 

Human PACS-1 niRNA sequence - var3 (public gi: 33243994) (SEQ ID NO: 97) 

CAGAAAGCATCCTC^GCACGCCAAAGCCC 

CCAGACGGAGATTGGCAGCCTCAACAGjCAAATC 

GCTGCTCTAGAAAAAATTAAATCTACTTGGATTAAAAACCAA 

TGGAAATCACTGACCAGGACATGTTTGGAGATGCCAGCACGAGTCTGGT 

TCCCATGAAGTCCAGTAAAACGGATCTCC^GGGCrCTGCCTCCCCCA 

CCCCGGC^GAAGAGGAGCACGCCCCTGAAGGAGCGGCAGCTCTCCAAGCCCCT 

GTTCCGACAGCGAGCGCTCCCCAGATCTGGGCCACAGCACGCAGATTCCAAGAAAGGTGGTGTATGACCA 

GCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCCCAGAAAATGTCATTCTGGTGAACACCACTGACT 

CAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTCCACCGTGG 

AGGTCGAGGCCGTGCTGTCCGCCCTGCTCACCCGGATCCAGCGCTA^ 

GAGGCGAGTGAAGGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCC^^ 

TCCCTGGCCAACAAGACCTCCGACTGGCTTGGCTACATGCGCTTCCTC^TCATCCCCCTCGGTTCTCACC 
CTGTGGCCAAATACTTGGGGTCAGTCGACAGTAAATACAGTAGTTCCTTCCTGGATTCTGGTTGGAGAGA 
TCTGTTCAGTCGCTCGGAGCCACCAGTGTCAGAGC^ 
AACGGGGGAGCCACGAGACACCAGCTTCCCGTGGCCGAAGCCATC 

ATGAAGACTCCTATC^GAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGACTCTCC 
CTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCTTACTC 

AGCTCXK^CCTGAGCCGAGACGCCACGGCCACCCCTCCCTCCTCCCCATCTATGAGCAGCGCCCTGGCG 

TCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGGCCACCC 

CGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTCTTCCGC 

TCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTT^ 

TCACCAAAGAAAAGAACAAGAAAGTTCCCACCATCTTCCTGAGC^ 

TTCTAAGAGCCAGGTCATTGAAGGC^TC^GCCGCCTCATCTGCTCAGCCAAGCAGCAGCAGACTATGCTG 
AGAGTGTCCATCGATGGGGTCGAGTGGAGTGACATCAAGTTCTTCGAGCTGGCAGCCCAGTGGCCCACCC 
ATGTCAAGC^CTTTCCAGTGGGACTCl^CAGTGGGAGCAAGGCGACCT^ 

TTCCCTCCTGGCACTGCCACCAGCCTCACCGCCTGCGGGCAGGGGGAGGCCAGCAGGCCCGGGCCCAGCA 
CCCCTTCCCTGGC&CCAGGGTCTGCCTCTCACTCGCCCAGGTC 

TCCCTCCCCTCCCCTCCAGCCCACC.CCrGC^C^GCCCCTCCrCCTTCCCGOTTTTCCCCTTCT 

GCTCCAGGCCCAAGGCGTGTTGGTTTTGCCTTCTGGTGCCCATAGTCCCCTGGACTGAGTCCCCCAGGCC 

TTCCTTCACCCGACTTCCAAACTCTTCCTTGTGGTATC^ 

CCTGGTCCCCAGCAGGAGAGCCTAGTGCTCCCCCAGCCCCTCCAGCCACCAGGGTGTCCTCTAGGATGC^ 
GCTGCCAGATCCACTCACTCTGCTGCCTCCAGCAGGACCCAAGGCCACTTTCAACTCTTATGGGGTTCTC 
CACCTGCCCCAGAGCTTCCCAAGGGAGGGTAAGGGGGCACCCTGAGCCCACAGGACCCCTACTTCACAGC 
TC7VCAGGGGCAGGAGGCAGCTCCCCTGCCTCCAGGACCCTGTTGCTATGGTGACACAGCGTTTCTAGGAC 
AGAGGGGCCTCCCAGTCTCCCCCCACCACCCGTGCACGACXTCCTCACCACCCCCAGGTTCCCTGCAGAT 
GTCGTGTGTGTCCTGAGTGTTTCTTTGGTTCTTTGCACGCCAAGTCTCTTGGTTGTACCATGTGACACAC 
CCTGTGCACTGGTCGCTGTCTTCGTGGCTTCCACCCTTGTTAATGATGCTCCTGCCTCTGCCTCCCAGCC 
CCTCACCCAGCACAGCTCTGCCTGGACTTGGAGAGATGGGAGGCAGACCCCCACCACCATACATGCTGTC 
TGTGGCCCCTCAGACATTCTGTTrCATCTCCC^TTCATCT 

TTTCCCTGCTCTGTTCTTCCCCCTCCTTAGGCCCCAGCCTGGGCCCAGACCCATCCTCCCAGCCAGGTTT 
CCCTCCAGCAGGCTCCTTCCCTCCCTGTCACCTCCCTCTCACCAACCCGGGGTCTGAGCCCCTCATTCCT 
GACCGTCCGTGTTCTCAGGAGTGGTTGAGGACACAGGGCCCCAGCCCAGCCCTCTGCACCCCCCAGCCCG 
GCC^TCTGCGCCCCACAGCCCCTTTGGAGCTTTTCTCTTGTCCTCTCACTCCTTCCCAGAAGTTTTTGCA 
CAGAACTTCArTTTGAAAGTGTTTTTCT^TTCTCTATACCTCCCCC^GCTCTCCTCCAGCCCTTCC^ 
GGGCTCAGCCCTGCTGTCCTGAGCGTCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACTGAGTT 
GATGTTGGGTTTTTCATTCAATAAATTGGTGATTTCTTACCGACAAAAAAAAAAAAAAA 

Human PACS-1 mRNA sequence - var4 (public gi: 34420884) (SEQ ID NO: 98) 

CGCCGCCGCCGCCGCGGGGGAAGCCTGGGAGCCAGATCGGCGTCGCCTCGGCCTCCGTAACCCCCGCCTA 
GCCGGGCCATGGCGGAACGCGGAGGGGCGGGCGGTGGTCCCGGAGGCGCCGGGGGCGGCAGCGGCCAGCG 
GGGATCCGGGGTCGCCCAGTCCCCTCAGCAGCCGATGGCGGAACGCGGAGGGGCGGGCGGTGGTCCCGGA 
GGCGCCGGGGGCGGCAGCGGCCAGCGGGGATCCGGGGTCGCCCAGTCCCCTCAGCAGCCGCCGCCGCAGC 
AGCAGCAGCAGCAGCCGCCGCAGCAGCCGACGCCCCCCAAGCTGGCCCAGGCCACCTCGTCGTCCTCGTC 
CACCTCGGCGGCGGCTGCCTCCTCCTCGTCCTCGTOTACCTCCACCTCCATGGCCGTGGCGGTGGCCTCG 
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GGCTCG3CGCCTCCCGGTGGCCCGGGGC 

CCTCGGAGGTGGACCGGAGCTOSTCCAGCTGCGTGCCT^^ 

CATGCTAAAAGAAATGGACAAAGATCTTAACTCAGTGGTCA 

ATTCTTCGCTCCAACGAGATCGTCCTTCGAGCTAGTGGACTGGTO 

CCCTTCAGTACCCTCATTTCCTTAAGCGAGATGCC^CAAGCTGCAGAT(^ 

TTACAAGAATCGGACCATCTTGGGCTATAAGACCTTGGCGGTGGGAOT 

CAGCATCCTAATGAAGGCGCACTGGTGCTTGGCCTACACAGCAACGTGAAGGA 

CAGAAATAAAGATCTACTCCCTGTCCAGCCAACCCATTGACCATGAA 

TCGTTCTCCTGATATTGACAATTATTCTGAGGAAGAGGAAGAGAGTTTCT 

GATGATCCATTGCATGGGCAGGACTTGTT CTACGAAGACGAAGAT CT CCGGAAAGTGAAGAAGACCCGGA 

GGAAACTAACCTCAACCTOTGCCATCACAAGGCAA.CCTAACATCAAACAGAAGCT 

GCGGTTTAAAGTTTC^GATGAGGTGGGCTTTGGGCTGGAGC^^ 

GAAGAGGACTTGGATGAATTGTATCACAGTCTGGAGAT^ 

AGGAGACAGAAAGCATCCTCAGC&CGCCAAAGCCCAAGCTCAAGCCTT^ 

X2AGCTCCCAGACGGAGATTGGCAGCCTCAACAGCAAAGGGA.GCCT 

GAATTGGCTGCTCTAGAAAAAATTAAATCTACTTGGATTAAAAACCAAGATGACAGGTTGACTGAAACAG 

ACACTCTGGAAATCACTGACCAGGACATGTTTGGAGATGCCAGCACGAGTCTGGTTGTGCCGGAGAAAGT 

CAAAACTCCCATGAAGTCCAGTAAAACGGATCTCCAGGGCTCTGCCTCCCCCAGCAAAGTGGAGGGGGTG 

CACACACCCCGGCAGAAGAGGAGCACXXICCCTGAAGGAGCGGCAGCTCTCCAAGCCCOTT^ 

CCAACAGTTCCGACAGCGAGCGCTCCCCAGATCrGGGCGACAGCACGCAGATTCGAA 

TGACCAGCTCAATCAGATCCTGGTGTCAGATG CAGCCCTC CCAGAAAATGTCATTCTGGTGAACACCACT 

GACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTCCA 

CCGTGGAGGTCCAGGCCGTGCTGTCCGCCCTGCTCACCCGGATCCAGCGCTACTGCAACTGCAACTCTTC 

CATGCCGAGGCCAGTGAAGGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCCATCCTCAGGTTCTTT 

GTCAAGTCCCTGGCCAAGAAGACCTCCGACTGGCTT(3GCTACATGCGCTTCCTCATCATCCCCCTCGGTT 

CTCACCCTGTGGCCAAATATTTGGGGTCAGTCGAGAGTAAATACAGTAGTTCCTTCCTGGATTCTGGTTG 

GAGAGATCTGTTCAGTCGCTCGGAGCCACCAGTGTCAGAGCAACTGGACGTGGCAGGGCGGGTGATGCAG 

TACGTCAACGGGGCAGCCACGACACACCAGCTTCCCGTGGCCGAAGCCATGCTGACTTGCCGGCATAAGT 

TCCCnt^TGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGA 

CTCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCTTACTGTGCCCTCCACATCACCA 

CCCTCCAGCTCGGGCCTGAGCCGAGACGCCACGGCCACCCCTCCCTCCTCCCCATCTATGAACAGCGCCC 

TGGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGG 

CCACCCCGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTC 

TTCCGCTCAGTGCACJGTGTCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTTTCTGGCACCATGGCCATGA 

CTGTGGTCACCAAAGAACTGAACAAGAAAGTTCCCACCATCTTCCTGAGCAAGAAACCCCGAGAAAAGGA 

GGTGGATTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCCT 

ATGCTGAGAGTGTCCATCGATGGGGTCGAGTGGAGTGACATCAAGTTCTTCCAGCTGGCAGCCCAGTGGC 
CCACCCATGTCAAGCACTTTCCAGTGGGACTCTTCAGTGGCAGCAAGGCCACCTAG 

Human PACS-1 mRNA sequence - var5 (public gi: 6330230) (SEQ ID NO: 99) 

CTGCC!!ATCAGAAGGCAACCTAACATCAAACAGAAGTTTGTGGCCCTCCTGAAGCGGTTTAAAGTTTCAGA 

TGAGGTGGGCTTTGGGCTGGAGCATGTGTCCCGCGAGCAGATCCGGGAAGTGGAAGAGGACTTGGATGAA 

TTGTATGACAGTCTGGAGATGTACAACCCCAGCGACAGTGGCCCTGAGATGGAGGAGACAGAAAGCATCC 

TCAGCACGCCAAAGCCCAAGCTCAAGCCTTTCTTTGAGGGGATGTCGCAGTCCAGCTCCCAGACGGAGAT 

TGGCAGCCTCAACAGCAAAGGCAGCCTCGGAAAAGACACCACCAGCCCTATGGAATTGG 

AAAATTAAATCTACTTGGATTAAAAACGAA.GATGACAGCTTGACTGAAACAGACACTCTGGAAAT 

ACCAGGACATGTTTGGAGATGCCAGCACGAGTCTGGTTGTGCCGGAGAAAGTCAAAACTCCCATGAAGTC 

CAGTAAAACGGATCTCC^GGGCTCTGCCTCCCCCAGCAAAGTGGAGGGGGTGCACACACCCCGGCAGAAG 

AGGAGCACGCCCCTGAAGGAGCGGCAGCTCTCCAAGCCCCTAAGTGAGAGGACCAACAGTTCCGACAGCG 

AGCGCTCCCCAGATCTGGGCCACAGCACGCAGATTCa^AGAAAGGTGGTGTATGACCAGCTCAATCAGAT 

CCTGGTGTCAGATGCAGCCCTCCCAGAAAATGTCATTCTGGTGAACACCACTGACTGGCAGGGCCAGTAT 

GTGGCTGAGCTGCTCCAGGACCAGCGGAAGCCTGTGGTGTGCACCTGCTCCACCGTGGAGGTCCAGGCCG 

TGCTGTCCGCCCTGCTCACCCGGATCGAGCGOTACTGCAACTGCAACTCTTCGATGCCGAGGCCAGTGAA 

GGTGGCTGCTGTGGGAGGCCAGAGCTACCTGAGCTCCATCCTCAGGTT^ 

AAGACCTCCGACTGGCTTGGCTAGATGCGCTTCCTCATCATCCCCCTCGGTTCTCACCCTGTGGCCAAAT 
ACTTGGGGTCAGTCGACAGTAAATACAGTAGTTCCTTCCTGGATTCTGGTTGGAGAGATCTGTTCAGTCG 
CTCGGAGCCACCAGTGTCAGAGCAACTGGACGTGGCAGGGCGGGTGATGCAGTACGTCAACGGGGCAGCC 
ACGACACACCAGCTTCCCGTGGCCGAAGCCATGCTGACTTGCCGGCATAAGTTCCCTGATGAAGACTCCT 
ATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGTGGGTCTGGTTGAAGACTCTCCCTCCACAGCAGG 
CGATGGGGACGATTCTCCTGTGGTGAGCCTTACTGTGCCCTCCACATCACCACCCTCCAGCTCGGGCCTG 
AGCCGAGACGCCACGGCCACCCCTCCCTCCTCCCCATCTATGAGCAGCGCCCTGGCCATCGTGGGGAGCC 
CTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGTGGACTACTGGCTGGGCCACCCCGGGGAGCGGAG 
GAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAACACCCTCAAGAGTGTCTTCCGCTCAGTGCAGGTG 
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TCCCGCCTGCCCGATAGTGGGGAGG 

AGAACAAGAAAGTTCCCACCATCTTCCTGAGCAAGA?VACCC CGAGAAAAGGAGGTGGATT CTAAGAGCCA 

GGTC^TTGAAGGCATCAGCCGCCTCATCTGCT^ 

GATGGGGTCGAGTGGAGTGACATCAAGTTCTTCCAGCTGGCA 

TTCCAGTGGGACTCTTCAGTGGCAGCAAGGCCACCTGAGGCCCTGTCT 

ACTGCCACCAGCCTCACCGCCTGttXXX^ 

CACCAGGGTCTGCOTCTCACTCGCCCAGGTCCCGAAGGACACTGCC^ 

CCTCCAGCCCACCCCTGCACAGCCCCTCCTCCTTCCCGCTTTTCCCCTTCT 

AGGCGTGTTGGTTTTGCCTTCTGGTGCC 

ACITCCAAACTOTTCCTTGTGGTATCAGTTTCCTTCT 

CAGGAGAGCCTAGTCCTCCCCCAGCCCCTCGkGC 

ACT(^CTCTGCTGCCTCCAGCAGGACCCAAGGCCACTTTCAACTCTTATGGGGTTOT 
AGCTTCCGAAGGGAGGGTAAGGGGGCACCCTGAGCCGACA 

GAGGCAGCTCCCCTGCCTCCAGGACCCTGTTGCTATGGTGACACAGCGTTTCTAGGACAGAGGGGCCTCC 

CAGTCTCCCCCCACCACCCGTGCACGACTTCCTC^ 

CTGAGTGTTTCTTTGGTTCTTTGCACGCCAAGTCT^ 

TCGCTGTCTTCGTGGCTTCCACCCTTGTTAATCATGCTCCTGCCTCTGCCTCCCAGCCCCTCACCCAG^ 

CAGCTCTGCCTGGACTTGGAGAGATGGGAGGCAGACCCC^ 

GACATTCTGTTTCATCTCCCATTCATCTCCCTC 

GTTCTTCCCCCTCCTTAGGCCCCAGCCTGGGC^ 

CTCCTTCCCTCCCTGTCACCTCCCTCT 

TCTC^GGAGTGGTTGAGGACACAGGGCCCCAGCCCAGCCCTCTGCACCCCCCAGCCCGGCCATCTGCGCC 

CCACAGCCCCTTTGGAGCTTTTCTCTTGTCCT 

TTGAAAGTGTTTTTCTGATTCTCCATACCTC^ 

GCTGTCCTGAGCGTCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACTGAGTTGATGTTGGGTTT 
TTCATTCAATAAATTGGTGATTTCTTACCG 

Human PACS-1 mRNA sequence - var6 (public gi: 7022110) (SEQ ID NO: 100) 

CCCTAAGTGAGAGGACCAACAGTTCCGACAGCGAGCGCTCCCCAGATC 

AAGAAAGGTGGTGTATGACCAGCTCAATCAGATCCTGGTGTCAGATGCAGCCCTCCCAGAAAATGTCATT 
CTGGTGAAC^CCACTGACTGGCAGGGCCAGTATGTGGCTGAGCTGCTCGAGGACCAGCGGAAGCOT 
TGTGCACCTGCTCGACCGTGGAGGTCCAGGCCX5TGCT 
CAACTGCAACTCTTCCATGCCGAGGCC^GT^ 

ATCCTCAGGTTCTTTGTCAAGTCCCTGGCCAACATGACCTCCGACTGGCTTGGCTACATGCGCTTCOTCA 
TCATCCCCCTCGGTTCTCACCCTGTGGCCAAATACTTGGGGTCAGTCGACAGTAAATACAGTAGTTCCTT 
CCTGGATTCTGGTTGGAGAGATCTGTTCAGTCGCTCGGAGCCACCAGTGTCAGAGCAACTGGACGTGGCA 
GGGCGGGTGATGCAGTACGTCAACGGGGCAGCGACGACAGACGAGC 

CTTGCCGGCATAAGTTCCCTGATGAAGACTCCTATCAGAAGTTTATTCCCTTCATTGGCGTGGTGAAGGT 

GGGTCTGGTTGAAGACTCTCCCTCCACAGCAGGCGATGGGGACGATTCTCCTGTGGTCAGCCTTACTGTG 

CCCTCCACATC^CCACCCTC<^GCTCGGGCCTGAGCCGAGACGCCACGGCCACCCCTCCCTCCT 

CTATGAGC^GCGCCCTGGCCATCGTGGGGAGCCCTAATAGCCCATATGGGGACGTGATTGGCCTCCAGGT 

GGACTACTGGCTGGGCCACCCCGGGGAGCGGAGGAGGGAAGGCGACAAGAGGGACGCCAGCTCGAAGAAC 

ACCCTCAAGAGTGTCTTCCGCTCAGTGCAGGTGTCCCGCCTGCCCCATAGTGGGGAGGCCCAGCTTTCTG 

GCACCATGGCCATGACTGTGGTCACCAAAGAAAAGAACAAGAAAGTTCCCACCATCTTCCTGAGCAAGAA 

ACCCCGAGAAAAGGAGGTGGATTCTAAGAGCCAGGTCATTGAAGGCATCAGCCGCCTCATCTGTTCTTCC. 

CCCTCCTTAGGCCCCAGCCTGGGCCCAGACCCATCCTCCCAGCCAGGTTTCCCTCCAGCAGGCTCCTTCC 

CTCCCTGTCACCTCCCTCTGACCAACCCGGGGTCTGAGCCCCTCATTCCTGACCGTCCGTGTTCTCAGGA 

GTGGTTGAGGA(^CAGGGCCC(^GCCCAGCCCTOTGCACCCCCCAGCCCGGCCATCTGCGCCC(^CAGCC 

CCTTTGGAGCTTTTCTCTTGTCCTCTCACTCCTT^ 

GTTTTTCTGATTCTCCATACCTCCCCCAAGCTCT 

GAGCGTCTCCTGGGCCAGAGAGAGGAGATGGGGGTGGGAGGGACTGAGTTGATGTTGGGTTTTTCATTCA 
ATAAATTGGTGATTTCTTACCGAC 

Human PACS-1 protein sequence - varl (public gi: 7022111) (SEQ ID NO: 362) 

MPRPVKVAAVGGQS YLS S I LRFFVKSLANMTSDWLGYMRFL 1 1 PLGSHPVAKYLGSVDSKYS S S FLDSGW 
RDLFSRSEPPVSEQLJDVAGRVMQYWGAATTHQLPVAEA^^ 

SPSTAGDGDDSPWSLTVPSTSPPSSSGLSRDATATPPSSPSMSSALAIVGSPNSPYGDVIGIjQVDYWIjG 
HPGERRREGDKRDASSKNTLKSVFRSVQVSRLPHSGEAQLSGT^^ 

VDSKSQVIEGISRLICSSPSLGPSIiGPDPSSQPGFPPAGSFPPCHLPLTNPGSEPLIPDRPCSQEWLRTQ 
GPSPALCTPQPGHIiRPTAPLELFSCPLTPSQKFIiHRTSF 

Human PACS-1 protein sequence - var2 (public gi: 6330231) (SEQ ID NO: 363) 
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AITRQPNIKQKFVALLKRFKVSDEVGFGIiEHVSREQ 

STPKPKIjKPFFEGMSQSSSQTEIGSIiNSKGSIiGKDTTSPMEIiAALEKJKSTO 
QDMFGDASTSLVVPEKVKTPMKSSKTD^ 

RSPDLGHSTQ I PRKWYDQI*NQILVSDAALPENVI LVNTTDWQGQYVAELLQDQRKPVVCTCSTVEVQAV 
LS AIOiTRIQRYCNCNSSMPRPVKVAAVGGQS YLSS ILRFFVKSLANKTSDWLGYMRFIjI I PLGSHPVAKY 
LGSVDS KYS S S FLDSGWRDLFSRS E PPVS EQLDVAGRVMQYWGAATTHQLPVAEAMLTCRHKFPDEDS Y 
QKFIPFIGVVKVGIJVEDSPSTAGDGDDSPVVSIiTVPSTSPPSSSGLSRDATATPPSSPSMSSAIAIVGSP 
NSPYGDVIGLQVDYWLGHPGERRREGDKRDASSKNTLK^ 

NKKVPTI FIjSKKPREKEVDSKSQVIEGI SRLICSAKQQQTMIiRVS IDGVEWSDI KFFQLAAQWPTHVKHF 
PVGLFSGSKAT 

Human PACS-1 protein sequence - var3 (public gi: 34420885) (SEQ ID NO: 364) 

MAERGGAGGGPGGAGGGSGQRGSGVAQSPQQPPPQQQQQQPPQQPTPPKLAQATSSSSSTSAAAASSSSS 

ststsmavavasgsappggpgpgrtpapvqmnlyatwevdrsssscvprlfsltl 

WI AVKLQGS KRI LRSNE I VLPASGLVETELQLTFSLQYPHFLKRDANKIjQ IMLQRRKRYKNRTI LG YKT 

IiAVGLINMAEVMQHPNEGALVLGIiHSNVKDVSVPVAEIKIYSLSSQP 

EEESFSSEQEGSDDPLHGQDLFYEDEDIiRKVKKTRRKLTSTSAITRQPNIKQKFVAIjIj 

lehvsreqireveedldelydslemynpsdsgpemeetesilst^ 

KGSLGKDTTSPMEIiAALEKI KSTWI KNQDDSLTETDTLE I TDQDMFGDAST SL WPE KVKTPMKS S KTDIi 

QGSASPSKVEGVHTPRQKRSTPIiKERQLSKPLSERTNSSDSERSPDLGHSTQIPRKVVYDQLNQILVSDA 

ALPENVIIiVNTTDWQGQYVAELIjQDQRKPVVCTCSTVEVQAVLSAIiIjTRIQR 

GQSYLSSimFFVKSIANKTSDWLGYmFLIIPL^ 

SEQLDVAGRVMQYWGAATTHQLPVAEAMLTC^ 

PWS LTVPSTS PPS SSGLSRDATATPPS S PSMNSAIiAI VGS PETS PYGDVI GLQVDYWXiGHPGERRREGDK 

?RDASSKNTIiKSVFRSVQVSRLPHSGEAQLSGTMAMTVVTKELNK^ 

SRIjICSAKQQQTMLRVSIDGVEWSDIKFFQLAAQWPTHVKHFPVGLFSGSKAT 

Human PACS-1 protein sequence - var4 (public gi: 33243995) (SEQ ID NO: 365) 

ESILSTPKPKLKPFFEGMSQSSSQTEIGSIjNSKGSLGKDTTSP 
EITDQDMFGDASTSLVVPEKAnCTPMKSSKTDIiQGSASPSKVE 

SDSERSPDIiGHSTQIPRKWYDQLNQILVSDAALPENVILVNTTDWQGQWAELIiQDQRKP 
VQAVLSALLTRIQRYOtfCNSSMPRPVKVAAVGGQSYLSSIL^ 

VAKYIjGSVDSKYSSSFLDSGWRDLFSRSEPPVSEQLDVAGRVMQYVNGAATTHQLPVAEAI^TCRHKFP^ 
EDSYQKFIPFIGVVKVGLVEDSPSTAGDGDDSPWSLTVPSTSPPSSSGLSRDATATPPSSPSMSSAIiAI 
VGSPNSPYGDVIGLQVDYWLGHPGERRREGDKRDASSKNTLKSVFRSVQVSRLPHSGEAQLSGTMT^MTVV 
TKEKNKKVPTIFDSKKPREKEVDSKSQVIEGISRLICSAKQQQTMIjRVSIDGVEWSDIKF 
VKHFPVGLFSGS KAT 

Human PACS-1 protein sequence - var5 (public gi: 30962846) (SEQ ID NO: 366) 

MAERGGAGGGPGGAGGGSGQRGSGVAQSPQQPPPQQQQQQPPQQPTPPKLAQATSSSSSTSAAAASSSSS 

STSTSMAVAVASGSAPPGGPGP.GRTPAPVQM^YATWEVDRS 

VVIAVKLQGS KRI LRSNE IVLPASGIjVETEIjQLTFSLQYPHFLKRDANKL^ 

LAVGLINMAEVMQHPNEGALVIjGLHSNVKDVSVPVAEIKIYSLSSQPIDHEGIKSKL 

EEESFSSEQEGSDDPIiHMDLFYEDEDIiRKVKKTRRKLTSTSAITRQPNIKQKF^ 

LEHVSREQIREVEEDLDEIjYDSLEMYNPSDSGPEMEETESILSTPKPKIjKPFFEGMSQSSSQTEIGSIiNS 
KGSLGKDTTS PMELAALE KI KSTWI KNQDDS LTETDTLE ITDQDMFGDAS TSLWPEKVKTPMKSSKTDL 
QGSASPSKVEGVHTPRQKRSTPLKERQLSKPLSERTNSSDSERSPDIiGHSTQIPRKVVYDQLNQILVSDA 
ALPENVILVNTTDWQGQYVAELLQDQRKPWCT^ 

GQSYLSSILRFFVKSLANKTSDWLGYl^FLIIPLGSHPVAKYLGSVDSKYSSSFLDSGWRDLFSRSEPPV 
SEQLDVAGRVMQYVNGAATTHQLPVAEAMLTCRHKFPDEDS YQKFI PFIGWKVGLVEDS PS TAGDGDDS 
PWSLTVPSTSPPSSSGLSRDATATPPSSPSMSSAIjAIVGSPNSPYGDVTGL^^ 
RDASSKNTLKSVFRSVQVSRIiPHSGEAQLSGTMAMTWTKEKNKKVPTIF^ 
SRLI CSAKQQQTMLRVS IDGVEWSDI KFFQIiAAQWPTHVKHFPVGLFSGSKAT 

Unigene Name: PPP1CA Unigene ID: Hs. 183994 

Human PPP1 CA niRNA sequence - varl (public gi: 287796) (SEQ ID NO: 101) 

GCAAGGAGCTGCTGGCTGGACGGCGGCATGTCCGACAGCXjAG 

GCCTGCTGGAAGTGCAGGGCTCGCGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCT 
GTGCCTGAAATCCCGGGAGATTTTTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATC 
TGCGGTGACATACACGGCCAGTACTACGACCTTCTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGA 
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gcaactacctctttctgggggactatgtc 

ggcctataagatcaagtaccccgagaacttcttcct^ 

cgcatctatggtttctacgatgagtgcaagagacgct 

gcttcaactgcctgcccatckx:ggcca^ 

ggacctgcagtctatggagcagattcggcgga 

tgtgacctgctgtggtctgaccctgacaaggacgtgcagggct 

ttacctttggagccgaggtggtggccaagttcctc 

ccaggtggtagaagacggctacgagttctttgccaag 

tactgtggcgagtttgacaatgctggcgccatgatgagtgtggacgagaccctgatgtc 

tcctcaagcccx3ccx^caagaacaaggggaagtacgggcagttca 

catcaccccaccccggaattccgccaaagccaagaaatagcccccgcac^ 

tggattgattgtagagaaatcatgctgccatgctgggggggggtcacccc^ 

cacggggaacatggagccttggtgtatttttcttttc 

cccagggctgcttcctgcot3cacctgc 

gcaacggcaggccaggtcgtgggtctccagccgtgcttggcct 

acccatctggtctcttgaataaaggtcaaagctggatcggaatc 

Human PPP1CA mRNA sequence - var2 (public gi: 21758300) (SEQ ID NO: 102) 

AAAAAAAAAAAAAGTTTTCCCTCCATGAGGCAGCGCGCCGACCGCCGAAGCATGGTCTC(^CCAGCGGCG 
CCGCCACCTCCAGCGTCCTCGGCAGGGAGTTGTGGTGCCGTAGAGGGCGGTCCCCGCGGCCCACGCCGCA 
CACCACCTGGGC7VGGGGGAGAOTCAGGGGGAGGCCCACACAOT 

CCTTGTGCCAAAATTCAGACCAGACCCCTCACTGGACATTCAAGAAGCCCCGTCCTCCAACGTGTCOT 

ATTGCACACGAGCTCTCCCTGCCACTCCCCATCTG^ 

<3CTTCCAGGCCCAGCTGGGGTCCCCCTCCAGGATGGCTCCT 

GTGCCCC^CCCTAGC^TCTCCCTGGGGCGCACCTTTCCCTACCCCACTGGAGCTCCCTGAGGG(^VGGGTC 
GAATCTCTCCCTCTCAGTGTAGCCTAGAGCGGGGTACTCAGGAGGGTCCGTAAGCCTTCCTGACTCTCCA 
GCTTAGAGGCCCCTCCTGAAGGCGTCCAGG<^CTAGAGGOT 

TGGGCAAGAGCTCTCTGGGAAGACGGGGAGGTCTAAGGCCAGCACAGAGTGGCCAGAGGGCCACACCAAA 
CTCCCATCCCTGGTCAGCCCAGGTGGCTCTGACCTGAGCAGGGCAGCTGGGGAGGTGGGTACACAGCCTC 
CACCAGGACACTCTCCTCCTCCTCCAGCTT^ 

TCTGCCCGCGGGTGCAGACGCCATGCCTGCCGCCCCCGCCAGCGCCAGCCACTGAGCTTCACAGCTACCT 
GCAGCAAGGAGGGGAAAGGGGCCTCCTGGACACCACCCCAGGTAOTGCAGGGTG 

GGAGCCGTGCAGGGCTCGCGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCC 
TGAAATCCCGGGAGATTTTTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGCGG 
TGACATACACGGCCAGTACTACGACCTTCTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAAC 
TACCTCTTTCTGGGGGACTATGTGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGCTGCTGGCCT 
ATAAGATCAAGTACCCCGAGAACTTCTTCCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCAT 
CTATGGTTTCTACGATGAGTGCAAGAGACGCTACAACATCAAACT 

AACTGCCTGCCGATCGCGGCCATAGTGGACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACC 
TGCAGTCTATGGAGCAGATTCGGCGGATCATGCGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGA 
CCTGCTGTGGTCTGACCCTGAGAAGKSACGTGCAC^ 

TTTGGAGCCGAGGTGGTGGCCAAGTTCCTCCACAAGCACGACTTGGACCTC^ 
TGGTAGAAGACGGCTACX^GTTCTTTGCCAAGC^^ 

TGGCGAGTTTGACAATGCTGGCGCCATGATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTC 

AAGCCCGCCGACAAGAACAAGGGGAAGTACGGGCAGTTCAGTGGCCTGAACCCTGGAGGCCGACCCATCA 

CCCC^CCCCGC^TTCCGCCAAAGCC^GAAAT^^ 

TGATTGTACAGAAATGATGCTGCCATGCTGGGGGGGGGTCACC^ 

GGAAC^TGGAGCCTTGGTGTATTTTTCTTTTCTTTTTTTAATGAATCAATAGCA 

GGCTGCTTCCTGCCTGCACCTGCGGTGACTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGCAA 
CGGCAGGCCAGGTCGTGGGTCTCCAGCCGTGCTTGGCCTCAGGGCTGGCAGCCGGATCCTGGGGCAACCC 
ATCTGGTCTCTTGAATAAAGGTCAAAGCTGGATTCTCGC 

Human PPP1 CA mRNA sequence - var3 (public gi: 14124967) (SEQ ID NO: 103) 

GGCTGCCGGAGGGCGGGAGGCAGGAGCGGGCGAGGAGCTGCTGGGCTGGAGCGGCGGCGCCGCCATGTCC 

.GA<^GCGAGAAGGTC^CCTGGACTCGATCATCGGGCGCCTGCTC^AAGTGCAGGGCTCGCGGCCTGGCA 

AGAATGTACAGCTGAC^GAGAACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTTTTCTGAGCCA 

GCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGCGGTGACATACACGGCCAGTACTACGACCTT 

CTGCGACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGGAACTACCTCTTTCTGGGGGACTATGTGGACA 

GGGGCAAGGAGTCCTTGGAGACCATCTGCCTGCTGCTGGCCTATAAGATCAAGTACCCCGAGAACTTCTT 

CCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGC 

CGCTAGAACATCAAACTGTGGAAAACCTTCACTGACTGC 

ACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGATTCGGCGGAT 
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CATGCGGCCCACAGATGTGCCTGACCAGGGCCTC 
GTGCAGGGCTGGGGCGAGAACGACCX3TGGCX5TCTCTTTTA 

T C CACAAGCACGACTTGGACCTCATCTGCCGAG CACACCA.GGTGGTAGAAGACGGCTACGAGTTCTTTGC 

GAAGCGGCAGCTGGTGACACTTTTCTC71GCT 

ATGAGTGTGGACXSAGACCCTCATGTGCTCTTTCC^G 

ACGGGCAGTTCAGTGGCCTGAACCCTGGAGGCCX^ 

GAAATAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGATTGATTGTACAG^ 
TGGGGGGGGGTCACCCCGACCCCTCAGGCCCACCTGTGACGGGGAACATGGAGCCT 

CTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTC^ 
GTGCTTGGCCTCAGGGCTGGGAGCCGGATCCTGGGGCA 

TGGATTCTC^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human PPP1CA mRNA sequence - var4 (public gi: 33872 852) (SEQ ID NO: 104) 

CCTCGTGCCGAATTCXK3CACGAGGAGCGGGCCAGGAGCT 
GACAGCGAGAAGCTCAACCTGGACTCGATCATC^ 

AGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTTTTCTGAGCCA 

GCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGCGGTGAGATACACGGCC^ 

CTGCGACTATTTGAGTATGGCX^TTTCCCTCCCGAGAGCAACTACCrCTTTCTGGGGGACTATGT 

GGGGCAAGC^GTCCTTGGAGACCATCTGCCTGCTGCTGGCCTATAAGATCAAGTACCCCGAGAACTTCT 

CCTGOTCCGTGGGAACCACGAGTGTGCCAGCATGAACCGCATCT 

CGCTACAACATCAAACTGTGGAAAACCTTCACTGACTGCTTCAA 

ACGAAAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGATTCGGCGGAT 
CATGCGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGAC 

GTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACOTTTGGAGCCGAGGTGGTGGCCAAGTTCC 
TCCACAAGCACGACTTGGACCTCATCTGCCGAGCACACCAGGTGGTAGAAGACGGCTACGAGTTCT 
CAAGCGGCAGCTGGTGACACTTTTCTCAGCTCCCAACTACTGTGG 
ATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTCA^ 

ACGGGCAGTTCAGTGGCCTGAACCCTGGAGGCCGACCCAT(^CCCCACCCCGCAATTCCGCCAAAGCCAA 

GAAATAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCATGCTGCCATGC 

TGGGGGGGGGGTCACCCCGACCCCTCAGGCCCACCTGTCACGGGGAAGATGGAGCCTTGGTGTAT 

TTTTCTTTTTTTAATGAATCAATAGCAGCGTCCAGTCCCCCAGGGCTGCTTCCTGCOT 

ACTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGCAACGGCAGGCCAGGTCGTGGGTCTCC^ 

CGTGCTTGGCCTCAGGGCTGGCAGCCGGATCCTGGGGCAACCCATCTGGTCTCTTGAATAAAG^ 

CTGGATTCTCGAAAAAAAAAAAAAAAAAA 

Human PPP1CA mRNA sequence - var5 (public gi: 12804878) (SEQ ID NO: 105) 
gaggagcgggccaggagctgctgggctggagcggcggcgccgc 

ggactcgatcatcgggcgcctgctggaagtgcagggctcgcggcctggcaagaatgtacagctgac^gag 

aacgagatccgcggtctgtgcctgaaatcccgggagatttttotgagccagcccattcttctggagctgg 

aggcacccctcaagatctgcggtgacatacacggccagtactacgaccttctgcgactatttgagtatgg 

cggtttccctcccgagagcaactacctcittctgggggactatgtggacaggggcaagcagtccttggag 

accatctgcctgctgctggcctataagatcaagtaccccgagaacttcttcctgctccgtgggaaccacg 

agtgtgccagcatcaaccgcatctatggtttctacgatgagtc 

gaaaaccttcactgactgctcc^ctgcct^ 

cacggaggcctgtccccggacctgcagtctatggagcagattcggcggatcatgcx5gcccacagatgtgc 
ctgaccagggcctgotgtgtgacctgctgtggtctgaccctgacaaggacgtgcagggctggggcgagaa 
cgaccgtggcgtctcttt^acctttggagccgaggtggtggcg^ 
ctcatctgccgagcacaccaggtggtagaagacggct^^ 

ttttctcagctcccaactactgtggcgagtttgacaatgctggcgccatgatgagtgtggacgagaccct 
catgtgctctttccagatcctcaagcccgccgacaagaacaaggggaagtacgggcagttcagtggcctg 
aaccctggaggccgacccatcaccccaccccgcaattc^ 

ccctgtgccccagatgatggattgattgtacagaaatcatgctgccatgctgggggggggtcaccccgac 

•ccctcaggcccacctgtgacggggaacatggagcct 

att^gcagcgtccagtcccccagggctgcttcctgcct 

ccgaggctgcagctcagggcaacggcaggccaggtcgtgggtct 

c^gccggatcctggggcaacc<^tctggtctctt^^ 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

Human PPP1CA mRNA sequence - , var6 (public gi: 34534606) (SEQ ID NO: 106) 

CTTCTTGCTGACGCCGC<^GCX5CCGAC^ 

CGAAGC^TGGTCTCCACC^GCGGCGCCGCCACCGCCTCGTCGGCCGCCGGCCCCAGCCGCGGCGCGGCCC 
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ACAGCCCCTCCAGCGCGCCGGACGCCTCCAGACACAGGCCGCCGTTC^^ 

CCAGCAGCGCCAGGACTCTGTCCACCACTGC^ 

CCCCXGCCAGCGCa\GCCACTGAGCTTCACAGCTACCT 

CCACCCCAGGTACTGCAGGGTGGGGCA.CTTCCGCCACAGGAGCCGTGCAGGGCT 

TGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTC 

ATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGCGGTGACAT^ 

GACTATTTGAGTATGGCGGTTTCCCTCCCGAGAGCAACTACCTCTTTCTGGGGGACTATG 

CAAGCAGTCCTTGGAGACCATCTGCCTGCTGCT 

CTCCGTGGGAACCACGAGTGTGCCAGCATCAACCGCAT 

ACAACATCAAACTGTGGAAAACCTTCACnHSACTGCTTCAACTGCCTGCCCAT 

AAAGATCTTCTGCTGCCACGGAGGCCTGTCCCCGGACCTGCAGTCTATGGAGCAGAT 

CGGCCC^CAGATGTGCCTGACCAGGGCCTGCTGTGTGACCTGCTGTGGTCTGACCCTGACAAGGA 

AGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCCA^ 

CAAGCACGACTTGGACCTCATCTGCCGAGCACACCAGGTGGTAGAAGA 

CGGCAGCTGGTGGCACTTTTCTCAGCTCCCAACTACTGTGGCGAGTTTGACAA.TGCT 

GTGTGGACGAGACCCTCATGTGCTCTTTCCAGATCCTCAAGC^ 

GCAGTTCAGTGGCCTGAACCCTGGAGGCTGACCCATCACCCCACCCCGCAATTCCGCCAAAG 
TAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGATTGATTGTACAGAAATCATGCTGCCA.TGCTGGG 
GGGGGGTCACCCCGACCCCTCAGGCCCACCTGTCACGGGGAACATGGAGCCTTGGTGTATTTTTCTTTTC 
TTTTTTTAATGAATCAATAGCAGCGTCCAGTCC^ 

GAGCAGGATCCTGGGGCCGAGGCTGCAGCTCAGGGCAACGGCAGGCCAGGTC 

TTGGCCTCAGGGCTGGCAGCCGGATCCTGGGGCAACCCATCTGGTCTCTTGAATAAAGGTCAAAGCT 
TTCTC 

Human PPP1CA mRNA sequence - var7 (public gi: 3058209 6) (SEQ ID NO: 107) 

ATGTCCGACAGCGAGAAGCTCAACCTGGACTCGATCATCGGGCGCCTGCTGGAAGTGCAGGGCTCGCGGC 
CTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGTCTGTGCCTGAAATCCCGGGAGATTTTTCT 
GAGCCAGCCCATTCTTCTCGA(3CTGGAGGGACCCCTCAAGAT^ 
GACCTTCTGCGACTATTTGAGTATGGCGGTTTC 

TGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGCTGCTGGCCTATAAGATCAAGTACCCCGAGAA 
CTTCTTCCTGCTCCGTGGGAACCACGAGTGTGCCAGC^TCAACCGCATCTATGGTTTCTACGATGAGTGC > 
AAGAGACGCTACAACATCAAACTGTGGAAAACCTTCACT 
TAGTGGACGAAAAGATCTTCTGCTGCGA(X3GAGGCCTGTCCCCGGAC^ 

GCGGATCATGCGGCCCACAGATGTGCCTGACCAGGGCCrGCTGTGTGACCTGCTGTGGTCTGACCCTGAC 
AAGGACGTGC^GGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCTTTGGAGCCGAGGTGGTGGCCA 
AGTTCCTCCAG^GCACXSACTTGGACCTCATCTGCCGA^ 

CTTTGCCAAGCGGCAGCTGGTGACACTTTTCTCAGCTCCCAACTACTGTGGCGAGTT 
GCCATGATGAGTGTGGACGAGACCCTGATGTGCTCTTTCCAGATCCTCAAGCCCGCCGACAAGAACAA 
GGAAGTACGGGCAGTTCAGTGGCCTGAACCCTGGAGGCCGACCCATCACCCC^CCCCGCAATTCCGCCAA 
AGCCAAGAAATAG 

Human PPP 1 CA mRNA sequence - var8 (public gi: 190515) (SEQ ID NO: 108) 

GGGCAAGGAGCTGCTGGCTGGACGGCGGCATGTCCGAC^GCGAGAAGCTCAACCTGGACTCGATCATC 
GCGCCTGCTGGAAGTGCAGGGCTCGCGGCCTGGCAAGAATGTACAGCTGACAGAGAACGAGATCCGCGGT 
CTGTGCCTGAAATCCCGGGAGATTTTTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGA 
TCTGCGGTGACATACACGGCCAGTACTACGACCTT^ 

GAGCAACTACCTCTTTCTGGGGGACTATGTGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGCTG 

CTGGCCTATAAGATCAAGTACCCCGAGAACTTCTTCCTGCTCCGTGGGAACCACGAGTGTGCCAGCATCA 

ACCGCATCTATGGTTTCTACGATGAGTGCAAGAGACGCTACAACATCT^AACTGTGGAAAACCTTCACTGA 

CTGCTTCAACTGCCTGCCC^TCGCGGCCATAGTGGACGAAAAGATCTTCTGCrGCCA 

CCGGACCTGCAGTCTATGGAGGAGATTCGGCGGATCATGCGGCCCACAGATGTGCCTGACCAGGGCCTGC 

TGTGTGACCTGCTGTGGTCTGACCCTGACAAGGACGTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTC 

TTTTACCTTTGGAGCCGAGGTGGTGGCCAAGTTCCTCCACAAGCACGACTTGGACCTCATCTGCCGAGCA 

GACCAGGTGGTAGAAGACGGCTATGAGTTCTTTGCCAAGCGGCAGCTGGTGACACTTTTCTCAGCTCCCA 

ACTACTGTGGCGAGTTTGACAATGCTGGC^CCATGATGAGTGTGGACX5AGACCCT(^TGTGCTCTTTC(^ 

GATCCTCAAGCCCGCCGACAAGAACAAGGGGAAGTACX-KS^ 

CCCATCACCCCACCCCGCAATTCCGCC^AAGCC^ 

■GATGGATTGATTGTACAGAAATCATGCTGCC^^ 

GTCACGGGGAACATGGAGCCTTGGTGTATTTTTCTTTT 

CCCCCAGGGCTGCTTCCTGCCTGCACCTGCGGTACTGTGAGCAGGATCCTGGGGCCGAGGCTGCAGCTCA 
GGGCAACGGCAGGCCAGGTCGTGGGTCTCCAGCCGTGCTTGGCCTCAGGCTGGCAGCCCGGATCCTGGGG 
CAACCCATCTGGTCTCTTGAATAAAGGTCAAAGCTGG 
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Human PPP1CA mRNA sequence - var9 (public gi: 190280) (SEQ ID NO: 109) 

CGGCCTGGCAAGAATGTACAGCTGAGAGAGAACG 

TTCTGAGCCAGCCCATTCTTCTGGAGCTGGAGGCACCCCTCAAGATCTGC 

CTACGACCTTCTGCGACTATTTGAGTATGGAGGTTTC^ 

TATGTGGACAGGGGCAAGCAGTCCTTGGAGACCATCTGCCTGC^ 

AGAACTTCTTCCTX3CTCCGTGGGAACCACGAGTGTGCCAGCA 

GTGCAAGAGACGCTACAACATCAAACTGTGGAAAACCTTCACTGAOT 

GCGATAGTGGACGAAAAGATCTTCTGCTGCCACGGAGGCCT^ 

TTCGGCGGATCATGCGGCCCACAGATGTGCCTGACCAGGGCCTGCTGTGTGACCT 

TGACAAGGACGTGCAGGGCTGGGGCGAGAACGACCGTGGCGTCTCTTTTACCT 

GCCAAGTTCCTCGACAAGCACGACTTGGACCTGATCTGCCGAGCACACCAGG 

AGTTCTTTGCCAAGCGGCAGCTGGTGACACTTTTCTCAGCTCCCAACTACTC 

TGGCGCCATGATGAGTGTGGACGAGACCCTCATGTGCTCTTTCCAGATC 

AAGGGGAAGTAttXSGCAGTTCAGTGGCCTGAACCCTGGAGGC^ 

CCAAAGCCAAGAAATAGCCCCCGCACACCACCCTGTGCCCCAGATGATGGA 

GCTGCCATGCTGGGGGGGGGTCACCCCGACCCCTCAGGCC 

TATTTTTCTTTTCTTTTTTTAATGAATCAG 

Human PPP1CA protein sequence - varl (public gi: 2 98964) (SEQ ID NO: 261) 

MSDSEKIdgiJDSI IGRLLEGSRVLTP^ 

•KICGDIHGQYYDLLRLFEYGGFPPESN^ 

INRIYGFYDECKRRYNIKLWKTFTDCFNCLPI^ 

LLCDLLWSDPDKDVQGWGENDRGVS FTFGAE WAKFLHKHDLDLI CRAHQWEDGYEFFAKRQLVTIjFS A 
PNY CGEFDNAGAMMSVDETLMCS FQI LKPADKNKGKYGQFSGLNPGGRP I TPPRNSAKAKK 




Human PPP 1 CA protein sequence - var3 . (public gi: 1 9 0 2 8 1) (SEQ ID NO- 263) 

RPGKNVQLTENE I RGLCLKSRE I FLSQPI LIiELEAPLKI CGD I HGQ YYDLLRLFE YGGFPPESNYLFLGD 
YVDRGKQSLETICLLLAYKIKYPENFFLLRGmE(^INRIYGFYDECKRRYNIKLWKTFTDCFNCLPIA 

AIVDEKJIFCCHGGLSPDLQSMEQIRRimPTDVPDQGLLCDLIiWSDPDKDVQGWGENDRGVSFTF 
AKFMKHDUJMCRAHQVVEDGTO^ 

KGKYGQFSGLNPGGRP I TPPRNSAKAKK 

Human PPP1CA protein sequence - (public gi: 35451) (SEQ ID NO: 395) 

MSDSEKLNIiDSIIGRIjLEVQGSRPGKNVQLTENEI^ 
DLLRLFEYGGFPPESNYLFLGDYVDRGKQSLETICL^ 

K^YNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQSMEQIRRIMRPTDVPDQGLLCD^ 
•KDVQGWGENDRGVS FTFGAEVVAKFLHKHDIiDLI CRAHQWEDGYEFFAKRQLVTLFSAPNYCGEFDNAG 
AMMSVDETLMCSFQILKPADKNKGKYGQFSGLNPGGRP I TPPRNSAKAKK 

Human PPP1CA pray sequence - varl (SEQ ID NO: 110) 

CCGCCTGGTNCTACCCATGACNCACOT 
ATTCCACACACAATACAAGTGCGCTCATCGACACGCA 

TAGGGCNAGNGCCCCQTTGGANTTCCNNTACAACNTNCCAGGATNACGCTCATATGGCC^ 

TGAATTCCACCCAAGCGGTGGTATCAACGCACAGTGGCCATTATGGCGGGCAGTGGCCA^ 
<3CC<^CCCATCACCC<^CCCCG^^ 

TNNATGATGGATTGATTGTACAGAAATCATGCTGCCATGCTGGGGGGGGG 

Unigene Name; PRKAR1A Unigene IDs Hs • 280342 

Human PRKAR1A mRNA sequence. - varl (public gi: 34530409) (SEQ ID NO- 1 1 1) 

ATCGCAGAGTGGAGCGGGGCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGC^ 
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GCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATG<^^ 

ACGCAGCCTTCGAGAATGTGAGCTCTACGTCGA.GAAGCATAA 

<5TGCAGTTGTGCACTGCTCGACCTGAGAGACCCATGGCM 

AAAACAGATT CAGAATCTGCAGiUU^GCAGGCACTCGTACAGACT CAAGGGAGGATGAGATTTCTCCTCCT 

CCACCC^CCCAGTGGTTAAAGGTAGGAGGCGACGAGGTGCTATC^ 

ATGCGGCATCCTATGTTAGAAAGGTTATACC^ 

TGAAAAGAATGTGCTGTTTTCACATCTTGAXGATAATGAGAGAAGTGATATTTTTGA 

GTCTCCTTTATCGCAGGAGAGACTGTGATTCAGCAAGGTGATGAAGG 

AAGGAGAGACGGATGTCTATGTTAACAATG^ 

ACTTGCTTTGATTTATGGAACACCGAGAGCAGCCACTGTCAAAGCAAAGACAAATGT 
ATCGACCGAGACAGCTATAGAAGAATCCTCATGGGAAGCACACT 

TCCTTAGTAAAGTCTCTATTTTAGAGTCTCTGGACAAGTGGGAACGTCTTACGGTAGCT^ 
ACCAGTCCAGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGA 

TTAGAGGGGTCAGCTGCTGTGCTACAACGTCGGTCAGAAAATGAAGAGTTTGTTGAAGTGGGAAGATTGG 

GGCCTTCTGATTATTTTGGTGAAATTGCACTACTGATGAATCGTCCTCGTGCTGCCACAGTTG 

TGGCCCCTTGAAGTGCGTTAAGCTGGACCGACCTAGATTTGAACGTGTTCTTGGCCC^ 

CTCAAACGAAACATCCAGCAGTACAACAGTTTTGTGTCA 

CCTTTTCTCCTCTCCCCAATCCATGCTTCACT 

GTGGCC^CTGGCATCGCAGCTTCCTGTOTGTTTATATATTGAAAGTTGCTTTTATT 
TTGGAGCATTAACTAAATGCTCATACACAGTTAAATAAATAGAAAGAGTTCTATGGAGACTTTGCT 

CTGCTTCTCTTTGTGCAGTGTTAGTATTCACCC 

CCGAGCACCTATTGAATTACGATAGAGTAATG 

TTGTCC^GTTATAAGCGTATTTAGACTGTGGCGATATATC 

CATTAAAOTCTAAAGATTAGGGAAAATGGATATAGAAAATCTTAGTATAGTAGAAAGACATCTGCCTGTA 
ATTAAACTAGTTTAAGGGTGGAAAAATGCCCATTTTTGCTAATTATCAATGGGATATGATTGGTTCAGTT 
TTTTTTTTTTTCCAGAGTTGTTGTTTGCC^ 

TTTCTTCTCCAATTCTGAAATACTTTTGAGTATGGCTATCTATACCTGCCTTTTAAGTTTGAAACTAACT 

GATAGATTGCAAATATTGGTTAGTATTTAACTA 

TCCAGCTAGTGCCAAATAATTGATCAGATGCTGAATTGAGJ^ 

GTTAATTTAGAGCGTTTGGTTAAAGTATGTCCTTCAGCTGACTCCAGTATAATCTCCTCTGCTCATTAAA 
CTGATTCCAGGAGATTGGATTTGCTGTGACTAGATACAGATGGAGCAAATGTCCTAACAGAGAAATAGAG 
GTGATG CTGCTAAAGGGAGAAATGCCAGGCGGACAAAGTTCAGTGT CGGGAATTTTC CC CGTGACATTCA 
CTGGGGCATGAGATTTTGGAAGAAGTTTTTTACTC 

TAGAATTTCTGGTGGGTTGATGGTAGGGTATAATGTGTCTGTGTTGCTTCAAATTGGTCTGAAAGGCTAT 
CCTGCTGAAAGTCCTGCTTTCCTATCTAGC^ 

AAGTAAACTTGTGTATTGAGTCTTAACTGTATTTCAGTATTTTCCAGCCTTATGTGTTACATTATTCCAA 
TGATACCCAACAGTTTATTTTTATTATTTTTTTTy^CAAAATTTCACAGTTCTGTAATGTAGGCACTTTT 
ATTTTCATTGTGATTTATATATAAGGTAATGTAGGGTTATATTTGGGAGTGAOTGCAAGCATTTTTCCAT 
CTGTGTGCAACTAACTGACTCTGTTATTGATCCCTTCTCCTGCCCTTTCCCAGGTAATTTAAATTGGTCA 
TGGTAGATTTTTTTCATAGATTTGAAAAACTTTTAGGTTGTTACCAAGTATGAAGTATAAATCTGGGGAA 
GAGGTTTTATTTACATTTTAGGGTGGGTAAGAAAGCCACCTTGTTACAAATTTTTTAATTTCCAAAATAA 
TCTATATTAAATGAGGGTTTCTGATCTGTACTTTGTGTTTAGCTACCTTTTTATATTTAAAAAATTAAAA 
ATGAAAATTATGTTCTTACAAGCTTAAAGCTTGATTTGAT 

GAGTTACTTAGAATGCCATAAAATTGCAGTTTCATGTATGTATATAATCATGCTCATGTATATTTAGTTA 
CGTATAATGCTTTCTGAGTGAGTTTTACTCTTAAATCATTTGGTTAAATCATTTGGCTTGCTGTTTACTC 
CCTTCTGTAGTTTTTAATTAAAAGCTTTAAAGAT^ 

ATCTTTGTGTAATCTAAGTATATGTGAGAAATCAGAATTGGCATAATTTGTCTTAGTTGATATTCAAGGC 
TTTAAAAGTCATTATTCCTGGGCTTGGTAAGTGAATTTATGAGATTTACTGCTCTAGAAAGTATAGATGG 
CCAAAGGACCGTTTTGTATTGCTTCCTGATTACCAGTCTGATTATACCATGTGTGCTAATATACTTTTTT 
TGTTATAGATTGTCTTAATGGTAGGTCAAGTAATAAAAAGAGATGAAATAATTT 

Human PRKAR1 A mRNA sequence - var2 (public gi: 4 8 84 2 7 9) (SEQ ID NO: 1 12) 

TATTTTCCAGCCTTATGTGTTACATTATTC 

AAAATTTCACAGTTCTGTAATGTAGGCACTTTTATTTTCATTGTGATTTATATATAAGGTAATGTAGGGT 
TATATTTGGGAGTGACTGCAAGGATTTTTCCAT^ 

TCCTGCCCTTTCCCAGGTAATTTAAATTGGTCATGGTAGATTTTTTTCATAGATTTGAAAAACTTTTAGG 

TTGTTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTATTTACATTTTAGGGT 

ACCTTGTTACAAATTTTTTAATTTCCAAAA 

TTTAGCTACCTTTTTATATTTAAAAAATTAAAAATGAAAATTACGTTCTTACAAGCTTAAAGCT 
GATCTTTGTTTAAATGCCAAAATGITACTTAAATGAGTT^ 

ATGTATATAATCATGCTCATGTATATTTAGTTACK5TATAATGCTTTCTGAGTGAGTTTTACTCTTAAATC 
ATTTGGTTAAATCATTTGGCTTGCTGTTTACTCCCTTCTGTAGTTTTTAATTAAAAACTTTAAAGATAAG 
TCTACATTAAACT^TGATGACATCTAAAGCT 
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TTGGCATAATTTGTCTTAGTTGATATTCAAGGCTTTAAAAGTCATTATTCOT 
TATGAGATTTACTGCTCTAGAAAGTATAGATGGCCAAAGGAC^ 
. CTGATTATACCATGTGTGCTAATATACTTT^ 
AAGAGATGAAATAATTTAAAAAAAAAAAAA 

Human PRKAR1 A roRNA sequence - var3 (public gi: 3 3 63 672 o) (SEQ ID NO: 1 13) 

GGTGGAGCTGTCGCCTAGCCGCTATCGCAGAGTGGAGCGGGGOT 
TACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCTOC^ 
AGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCTA 
AAGCGCTGCTGAAAGATTCTATTGTCCAGTTC 

GGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACAGATTCAGAATCTGCAGAA 

ACAGACTCAAGGGAGGATGAGATTTCTCCTCCTCCAC^ 

GTGCTATCAGCGCTGAGGTCTAGACGGAGGAAGATGCGGC^ 

TTACAAGACAATGGCCGCTTTAGCCAAAGCCATTGAAAAGAATGTGCre 

GAGAGAAGTGATATTTTTGATGCCATGTTTTCGGTCTCCTTTATCGCAGGAG^ 

GTGATGAAGGGGATAACTTCTATGTGATTGATCAAGGAGAGACGGATGTCT 

AACCAGTGTTGGGGAAGGAGGGAGCTTTGGAGAACrTGCTTTGAT^ 

GTGAAAGCAAAGACAAATGTGAAATTGTGGGGCATCGACC 

GCAGACTGAGAAAGCGGAAGATGTATGAGGAATTCC 

. GTGGGAACGT CTTACGGTAG CTGATGCATTGGAACCAGTG CAGTTTGAAGATGGGCAGAAGATTGTGGTG 
(^GGGAGAACCAGGGGATGAGTTCTTCATTATTTTAGAGGGGTCAGCTGCrrGT 

AAAATGAAGAGTTTGTTGAAGTGGGAAGATTGGGGCCTTCTGATTATTTTGGTGAAATTGCACTACTGAT 
GAATCGTCCTCGTGCTGCCACAGTTGTTGCTOTTGGCCCCTTGAAGTGCGTTAAGCTGG 

tttgaacgtgttcttggcccatgctgagacatcctca} 

cactgtctgtctgaaatctgcctcctgtgcctcccttttctcctctccccaatccatgct 
aaactgctttattttccctacttc^ 

attgaaagttg cttttattgcac cattttcaatttggagcattaactaaatgctcatacacagttaaata 

aatagaaagagttctatggagactttgctgttactgcttctc 

c^gtgagtgccatgctttttggtgagggcagatccc^^ 

acagtgcaagattttttttttaagtgacataattgtccagttataagcgtatttagactgtggccatata 
tgctgtatrtctttgtagaataaatggtt^ 

atcttagtatagtagaaagacatctgcctgtaattaaactagtttaagggtggaaaaatgcccatttttg 

CTAATTATCAATGGGATATGATTGGTTCAGTTT^ 

CCTGGTTTTATTTATATCTTGTTATTAATGTTTCTTCTCCAATTCTGAAATACTTTTGAGTATGGCTATC 
TATACCTGCCTTTTAAGTTTGAAACTAACTCATAGATTGCAAATATTGGTTAGTATTTAACTACATCTGC 
CTCGGCTCACAAATTCCGATTAGACCTTTATCCAGCTAGTGCGAAATAATTGATCAGATGCTGAATTGAG 
AATAAGAATTTGAGGTCTACATTCTTGGTTGTTAATTTAGAGCGTTTGGTTAAAGTATGTCCTTCAGCTG 
ACTCGAGTATAATCTCCTCTGCTCATTAAACT 

TGGAGCAAA.TGTCCTAACAGAGAAATAGAGGTGATGCTGCTAAAGGGAGAAATGCCAGGCGGACAAAG . 
CAGTGTCGGGAATTTTCCCCGTGACATTCACTGGGGCATGAGATTT^ 

TAGTCTTTTTTTCCTTCCTTTTTATTCAGCTAGAATTTCTGGTGGGTTGATGGTAGGGTATAATGTGTCT 

GTGTTGCTTCAAATTGGTCTGAAAGGCTATCCTGCGGAAAGTCCTGCTTTCCTATCTAGCATTTATTTCT 

CTGGCAAACTTTTCTTTCTTTTCTTTTTT^^ 

TTTCCAGCCTTATGTGTTACATTATTC 

ATTTCACAGTTCTGTAATGTAGGCACTTTTATTTTCATTGTGATTTATATATAAGGTAATGTAGGGTTAT . 
ATTTGGGAGTGACTGCAAGC^TTTTTCGATCT ' 
* TGCCCTTTCCCAGGTAATTTAAATTGK3TCATGGTA 
TTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTT^^ 

TTGTTACA^TTTTTTAATTTCCAAAATAATOTATATTAAATGAGGGTTTCTGATCTGTACTTTGTGTTT 
AGCTACCTTTTTATATTTAAAAAATTAAAAATGAAAATTACGTTCTTACAAGCTTAAA 
C^TGTTTAAATGCCA&AATGTACTTAAATGAGTTACT^ 
TATATAATCATGCTCATGTATATTTAGT^^ 

TGGTTAAATGATTTGGCTTGCTGTTTACTCCCTTCTGTAGTTTTTAATTAAAAACTTTAAAGATAAGTCT 

ACATTAAACAATGATCACATCTAAAGCTTTATCTTTGTGTAATCTAAGTATATGTGAGAAATCAGAATTG 

GCATAATTTGTCTTAGOTGATATTCAAGGCTTTAAAAGTCATTATTCCTGGGCTTGGTAAGTGAATTTAT 

GAGAT^ACTGCTCTAGAAAGTATAGATGGCGAAAGGACCGTTTTGTATTGCTTCCTGATTACC^ 

ATTATACCATGTGTGCTAATATACTTTTTTTGTTATAGATTGTCTTAATGGTAGGT(^AGTAATAAAAAG 

AGATGAAATAATTTAAAAAAAAAAAAAAAA 

Human PRKAR1 A mRNA sequence - var4 (public gi: 1526989) (SEQ ID NO: 114) 

GCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCG 
CCCGTCCCCAGAGAACCATGGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATG 
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TGAGCTCTACGTCCAGA^GCATA^ 

CGACCTGAGAGACCCATGGCATTCOTCAGGGAATACTTTGAG^ 
TTCAGAATCTGCAGAAAGCAGGCACTCGTACAGACT 

cccagtggttaaaggtaggaggcgacgaggtgctatcagc^ 

tcctatgttagaaaggttataccaaaagattagaagacaatc 

atgtgctgttttcac^tcttgatgataatgagagaagtgatattct 

tatcgcaggagagactgtgattcaggaaggtgatgaaggggataacttctatc 

acggatgtctatgttaacaatgaatgggcaaccagtgttggggaaggag 

TGATTTATGGAAC^CCGAGAGCAGCCACTCTCAAAGCAAAG^ 

AGACAGCTATAGAAGAATCCTC^TGGGAAGCACACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTAGT 
AAAGTCTCTATTTTAGAGTCTCTGGACAAGTGGGAACGTCTTAC^ 

AGTTTGAAGATGGG CAGAAGATTGTGGTGCAGGGAGAAC CAGGGGATGAGTTCrTCATTATTTTAGAGGG 

GTCAGCTGCTGTGCTACAACGTCGGTGAGAAAATGAAGAGTTTGTTGAAGTGQGAAGAT 

GATTATTTTGGTGAAATTGCACTACTGATGAATCGT^ 

TGAAGTGCGTTAAGCTGGACCGACCTAGATTTGAACGTGTTCTOGGCCCATGCTCAGACATCCTCTWVTO 
AAAC^TCCAGCAGTACAACAGTTTTGTGTCAC 

CCTCTCCCCAATCCATGCTTCACTCATGCAAACTGCrTTATTTTC^ 
TGGCATCGCAGCTTCCTGTCTGTTTATATATTGAAAGTT 

TTAACTAAATGCTCATA(^CAGTTAAATAAATAGAAAGAGTTCTATGGAGACTTTGCTGTTACTGCTT 

CTTTGTGCAGTGTTAGTATTCACCCTGGGCAGTGAGTGCCATGCTTTTTOT 

TATTGAATTACCATAGAGTAATGATGTAAC^GTGCAAGATTTTTTTTTTTAAGTGACAT 

TAT7VAGCGTATTTAGACTGTGGCCATATATGCTGTATTTCTTTGTAGAATAAATGGTTTCT 

CTAAAGATTAGGGAAATGGATATAGAAAATCTTAGTATAGTAGAAAGAC^TCTGCCTGTAATTAAACTAG 

TTTAAGGGTGGAAAAATCAAAATTTTTGCTAATTAT 

AGAGTTGTTGTTTGCCAAGCTAATCTGCCTGGTTTATTTATATC 

CTGAAATACTTTTGAGTATGGCTATCTATACCTGCCTTTTAAGTTTGAAACTAACTCATAGATGCAAATA 
TTGGTTAGTATTTAAOTACATCTGCCTCGGCTCACAAATTCCGATTAGACCTTTATCCAGCTAGTGCCAA 
ATAATTGATCAGATGCTGAATTGAGAATAAGAATTTGAGGTCT^ 

TTGGTTAAAGTATGTCCTTCAGCTGACTCC^GTATAATCTCCTCTGCTCATTAAACTGATTCCAGGAGAT 

TGGATTTGCTGTGACTAGATACAGATGGAGCAAATGTCCTAACAGAGAAATAGAGGTGATGCTGCTAAAG 

GGAGAAATGCCAGGCGGACAAAGTTCAGTGTCGGGAATTTTCCCCGTGACATTCACTGGGGCATGAGAT^ 

TTGGAAGAAGTTT.TTTACITTGGTTTAGTCTTTTTTTCCTCCTTT 

TTGATGGTAGGGTATAATGTGTCTGTGTTGCTTGAAATTGGTCTGAAAGGCTATCCT 

CTTTCCTATCTAGCATTTATTCCTCTGGCAAA.CTTTTCTTTCTTCT 

TGAGTCXTAACTGTATTTCAGTATTTTCCAGCCTTATGTGTTACATTATTCCAATGATACCC^CAGTOT 

ATTTTTATTATTTTTTTAAACAAAATTTCACAGT 
ATATATAAGGTAATGTAGGGTTATATTTGGGAGTGACTGCAAG<^ 

GACTCTGTTATTGATCCCTTCTCCTGCCCTTTCCCAGGTAATTTAAATTGGTCATGGTAGATTTTTTTCA 
TAGATTTGAAAAACTTTTAGGTTGTTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTATTTACAT 
TTTAGGGTGGGTAAGAAAGCCACOTTGTTACAAATTTTTTAATTTCCAAAATAATCTATATTAAATGAGG 
GTTTCTGATCTGTACTTTGTGTTTAGCTACCTTTTTATATTTAAAAAATTAAAAATGAAAATTATGTTCT 
TACAAGCTTAAAGCTTGATTTGATCT 

Human PRKAR1 A mRNA sequence - var5 (public gi: 1526988) (SEQ ID NO: 115) 

GGCAGAGTGGAGCGGGGCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGCCGCCTCGCACCCGCAGC 
CTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATGGAGTCTGGCAGTACCGCCX3CCAGTGAGGAGGCAC 
GCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAGCATAACATT 

GCAGTTGTGCACTGCTCGACCTGAGAGACCCATGGCATTCCT^GGGAATACTTTGAGAGGTTGGAGAAG 
GAGGAGGCAAAACAGATTC^GAATCTGCAGAAAGCAGGCACTCGTACAGACTCAAGGGAGGATGAGATTT 
CTCCTCCTCCACCCAACCCAGTGGTTAAAGGTAGGAGGCGACGAGGTGCTATCAGCGCTGAGGTCTACAC 
GGAGGAAGATGCGGCATCCTATGTTAGAMGGTTATACCAAAAGATTACAAGACAATGGCCGCTTTAGCC 
AJ^GCCATTGAAAAGAATGTGCTGTTTTGAGATCT^ 

TGTTTTCGGTCTCCTTTATCGCAGGAGAGACTGTGATTCAGCAAGGTGATGAAGGGGATAACIT 

GATTGATCAAGGAGAGACGGATGTCTATGTTAACAATGAATGGGCAACCAGTGTTGGGGAAGGAGGGAGC 

TTTGGAGAACTTGCTTTGATTTATGGAACACCGAGAGCAGCCACTGTCAAAGCAAAGACAAATGTGAAA 

TGTGGGGCATCGACCGAGACAGCTATAGAAGAATCCTCATGGGAAGCACACTGAGAAAGCGGAAGATGTA 

TGAGGAATTCCTTAGTA7UVGTCTCTAXTTTAGAGTCTCTGGACAAGTGGGAACGTCTTACGGTAGCTGAT . 

GCATTGGAACCAGTGCAGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGATGAGTTCT 

TCATTATTTTAGAGGGGTCAGCTGCTGTGCTACAACGTCGGTCAGAAAATGAAGAGTTTGTT 

AAGATTGGGGCCTTCTGATTATTTTGGTGAAATTGCACTACTGATGAATCGTCCTCGTGCTGCCACAGTT 

•GTTGCTCGTGGCCCCTTGAAGTGCGTTAAGCTGGACCGACCTAGATTTGAAGGTGTTCTTGGCCCATGOT 

GAGACATCCTCAAACGAAACATCCAGCAGTACAAC^ 

TGTGCCTCCCTTTTCTCCTCTCCCCAATCCATGCTTCACTCATGCAAACTGCTTTATTTTCCCTACTTGC 
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AGCGCGAAGTGGCCACTGGCATCXXIAGCT 

TTTCAATTTGGAGCATTAACTAAATGCTCATACACA 

AAAAAA 

Human PRKAR1 A mRNA sequence - var6 (public gi: 9956010) (SEQ ID NO: 1 16) 

AACTGACTCTX3TTATTGATCCCTTCT 

TTCATAGATTTGAAAAACTTTTAGGTTGTTAC 

ACATTTTAGGGTGGGTAAGAAAGCCACCTTGTT 

GAGGGTTTCTGATCTGTACTTTGTGTTTAGCTACCTTTTTATATTO 

TTCTTAOU^GCTTAAAGCTTGATTTGATCTTTGTOT 

ATGCCATAAAATTGCAGTTTGATGTATGTATATAAT^ 

TCTGAGTGAGTTTTACTCTTAAATC^TTTGGTTAAATCATTTGGCTTGCTG 

TTTAATTAAAAACTTTAAAGATAAGTCTACAT^ * 
TCTAAGTATATGTGAGAAATCAGAATTGGCATAATTTGTC 

TATT C CTTG GGCTTGGTAAGTGAATTTATGAG ATTTACTGCT CT AGAAAGTATAGATGGC CAAAGGACCGT 
TTTGTATTGCTTCCTGATTACC^GTCTGATTATACCATGTGTGCTAATATACT 
TCTTAATGGTAGGTCAAGTAATAAAAAGAGATGAAATAATTTAAATTCra 
CCTTTCTCCTTTCCGTCTTTCCT 

GGGGGAAAGTGAATTATGGGATCGGTGTTTTGAAAGAGCAATGTTTATTTTCAGTGCTTTTC^ 
AAAGAGTGGATCTCAAAATCTTGCTTAAAGGGTAATTC^ 
AAGAAAAAAATTCAGTCAAAAGCTAAAGATTTCCTTTTGATT 
ACTTTCCCAGACTTAATGGGGAAACATGATTTCT 

CATTTAATGTTACTTAGGGATACTTTTATATTTTGCATATATAAAGCCTCATAT^ 

GATGCTCTTAGATTTCTGAGGAGTGAGATGATTAAGTO 

CCAGTGAAATTGGAGATATGTTGTATGTTAGAAGAGCACT^ 

CCITTTCTAGATTCAGTAATCCCTTCCCCCCGTC 

GTAACTAAAGAAAAAAAAAAAAAAAAAAAAAAAAA 

Human PRICAR1 A mRNA sequence - var7 (pubHc gi: 21757J5S) (SEQ'ID NO: 117) 

TAATTTTCTTGTGTGTTTTTAAAAATTTTG . 

GTTTGCTCTGTGACGTTAGGATACTCCCATGGGATAGAAGTTACGTATAGGGAATGTCAGATATTCTTCA 

TTGTGCTGACTTGCTTTCGCTTACAGTTGACTTTTGTGCCOTGGTAATT 

CCTACTTCCCACGTGATCATGATTTCTT^ 

AGCACCCGTCTCTGC^GAGGTTAGTGGCTCATACTTCCrCCCAGGAGCTGAGGTTATCGACTCTCA 

TGCCTACAGAGCACAGATCCTGAACTAAATGAAACATTTACTTGGAATAATGCTA 

TATTCCCTAGTCCCCACTTCCCTGTTTAAAAAGAAAAT^ 

TCTAATGAATTTAGCAAGTTAAATGCCAGATTGACATTXTGCTTTATAGTTTATACAAGCATGTGTGTGT 

TTTTTTCTCGCAGAGAACCTVTGGAGTCTGGCAGTACCGCC^ 

TGTGAGCTCTACGTCCAGAAGGATAACATTCAAGCGCTGCTCAAAG^ 

CTCGACCTGAGAGACCCATGGCATTCCTCAGGGAATAOTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACA 
GATTCAGAATCTGCAGAAAGCTIGGC^CTCGTACAGACTCAAGGGAGGATGAGATTTCTCCTCCTCCACCC 
AACCCAGTGGTTAAAGGTAGGAGGCGACGAGGTGCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGG 
CATCCTATGTTAGAAAGGTTATACCAAAAGATTACAAGACAATGGCCGCTTTAGCCAAAGCCATTGAAAA 
GAATGTGCTGTTTTCACATCTTGATGATAATGAGAGAAG 

TTTATCGCAGGAGAGACTGTGATTCAGCAAGGTGATGAAGGGGATAACTTCTATGTGATTGATCAAGGAG 
AGACGGATGTCTATGTTAACAATGAATGGGCAACCAGTGTTGGGGAAGGAGGGAGCTTTGGAGAACTTGC 
TTTGATTTATGGAACACCGAGAGCAGCCACTGTCAAAGCAAAGAC^ 

CGAGACAGCTATAGAAGAATCCTCATGGGAAG<^CACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTA 

GTAAAGTCTCTATTTTAGAGTCTCTGGACAAGTGGGAACGTCTTACGGTAGCTGATGCATTGGAACCAGT 

GCAGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGATGAGT^ 

GGGTCAGCTGCTGTGCTACAACGTCGGTCAGAAAATGAAGAGTTTGTT^ 

CTGATTATTTTGGTGAAATTGCACT^ 

CTTGAAGTGCGTTAAGCTGGACCGACCTAGATTTGAACGTGTTCTTGGCCCATGCTCAGACATCCT 

CGAAACATCCAGCAGTACAAC^GTTTTGTGTCACTGTCTGTCTGAAATCCGCCTCCTGTGCCTCCCTTTT 

CTCCTCTCCCCAATC<^TGCTTCyvCTCATGG?^CTGCTTTATTTTC 

ACTGGCATCGCAGCTTCCTGTCTGTTTATATATTGAAAGTTGCTTTTATTGCACCATTTTCAATTTGGAG 
CATTAACTAAATGCTCATACACAGTTAAATAAATAGAAAGAGTTCTATGG 

Human PRKAR1 A mRNA sequence - var8 (public gi: 16583 05) (SEQ ID NO: 118) 

AGAGGCGTCAAGGGAGGCCGGAGGGAGAGTGGGGTGGACAGAGGAGCGGAGGGACGAGAGGGAAGCGCAC 
GATAGCTGCGCGGAGAGAGAGCGAAGAGCAGGAGGAGGAACAAAGGCGACCCAAGACACCCAGAGAGGGA 
CAGAGAACCATGGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCT 
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ATCTCCAGAAGCATAACATTCAAGCGC 

GAGACCCATGGCATTCCTCAGGGAATACTTTGAGAGGT^^ 

CTGCAGAAAGGAGGGA£^CGTACAGACTCAAGG<^ 

Human PRKAR1A protein sequence - varl (public gi: 4506063) (SEQ ID NO: 264) 

MESGSTAASEEARSLRECELYVQKHNIQAIiIjKDS 

AGTRTDSREDEI SPPPPNPWKGRRRRGAI SAEVYTEEDAAS YVRKVI PKDYKTMAAI1AKAIEKNVX1FSH 
LDDNERSDI FDAMFS VS FI AGETVIQQGDEGDNFYVIDQGETDVYVNNEWATSVGEGGSFGEIiALI YGTP 
RAATVKAKTNVKXjWGIDRDSYRRILMGSTI^^ 
KIWQGEPGDEFFIILEGSAAVIjQRRSENEEFVEVGRLGPSOT 
DRPRFERVLGPCSDILKRNIQQYNSFVSIiSV 

Human PRKAR1A protein sequence - var2 (public gi: 1658306) (SEQ ID NO: 265) 

I^SGSTAASEEARSLRECEIjYVQKHNIQAIjL 
AGTRTDSREDEISPPPP 

Human PRKAR1 A pray sequence - varl (SEQ ID NO: 119) 

GCCGCCTGGTNTACCCATACGACGTACCAGTAT^ 

CCCAAGCAGTGGCTATCAACGCAGAGTGGTAGCGGGGCATGGGAGCAAAGCAGCATGAGGGAGCTCGGTA 
raCCGCCGCCTCNCACCCGCAGCCTCGCGCCCGCCGCCGCCCGTCCCCAGNGAACCATGGAGTCTGGCAG 
TACCGTTTCCAGTGAGGAGGCACTNCAGCCTTCGAGAATGTGAGCTCrNNGTCCAGAAGCATNACATTCA^ 
* TGCGCTNCTCAAAGATTCTNTTGTGCANTTGT 

GGAATACTTGGCGNACGNNGNNTAATGANGAGGCCCNNTNTNTTN CAAANTCTNCANNTNTTTNNNTCTT 

TNACAAA(nTTTTGGAOTATNANNNANCCOmJNNANANAAANA 

NCCC 

Human PRKAR1 A pray sequence - var2 (SEQ ID NO: 120) 

GAGCGCCGCC^TGGNANTACCCATACGACGTACCAGNATTACGCTCATATGGCCATGGAGGCCAGTGAAT 
TCCACCCAAGC^GTGGTATCAACGCAGAGTGGTAGCGGGGCT^ 

CGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATGGAGTCrGGCAG 
TACCGCCGCCAGTGAGGAGG<^CGC^GCCTTCGAGAATGTGAGCTCTACGTCCAGAAGCATAACATTC^ 
GCGCTGCTCAAAGATTCTATTGTGCAGTTGTGCACTGCTCGACCTGAGAGACCCA 

AATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAAC^GATTCAGAATCTGCAGAAAGC^^CACTCGTAC 
AGACTC^UVGGGAGGATGAGATTTCTCCTCCTCCACCCAACCCAGTGGTTAAAGGTAGGAGGCGACGAGGT 
GCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGGCATCCTATGTTAGAAAGGTTATACCAAAAGATT 
ACAAGACGATGGCCGCTTTAGCCAAAGCCATTGAAAAGAATGTGCTG 

GAGAAGTGATATTTTTGATGCCATGTTTTCGGTCTCCTTTATCGCAGGAGAGACTGTGATTCANCAAGGT 

GATGAAGGGGATAACTTCTATGTGATTGATCAAGGAKTAGACNGATGTCTATGTTAACAATGAATGGGCNA 

CCANTGTTGGGGAAGGAGGGAGCTTTGGAAAACTTGCT^ 

AAACCAAAACAAA 

Human PRKAR1 A pray sequence - var3 (SEQ ID NO: 121) 

CGACGCCGCCTGGTATACCCATACGACGTACCAGTATTACGCTCATATGGCCATGGCAGGCCAGTGAATT 

CCACCCAAGCAGGTGCGATATGCATACGCGAGNAGTGAGTAACGGCGGCTGGGTAGCGAAGTCGCTGAGG 

GAGCTCGGTACNCCGCCAGCGCTCGCACCCGCANCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCAT 

GGAGTCTGGCAGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAG ' 

CATAACATTCAAGCGCTGCTCAAAGATTOTATTGTGCA.GTTGTGCACTGC 

CATTCCTCAGGGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACAGATTCAGAA^ 

AGGCACTCGTAC7VGACTCAAGGGAGGATGAGATTTCTC 

AGGCGACGAGGTGCTATC^GCGCTGAGGTCT^^ 

TTTTTGATATTTGAATATCGGGGGGGATGOTTTNGGGACCCACTTGGTGGTCATCTANTCTCCOT 
ANTGATTCTTAAATCGAAAACNGGGNGGAACTTGATCNNNCT 
TTTTTNTAATACCNNCTTNNCAANGAAANANCNNTTNGNGTTTTNi^ 
GTTNNNNTTTCCNTCNN^ 

Human PRKAR1 A pray sequence - var4 (SEQ ID NO: 122) 

CGTANCNNCGCGNGACTCGGTGACTGANGCGATGATCGC^ 
GGNTCAGTGTGCAGGGCCATGTTGANNTCTCOT 

AGAGANATATTTTTANTACTNACTCACTATAGGGCGAGCGCCGCCATGGAGTACCCATAOT 

ATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCACCCAAGCAGTGGTATCAACGCAGAGTGGAGCGG 

GGCTGGGAGCAAAGOSCri^GGGAGCTCGGTACGC^ 
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GCCCGTCCCCAGAGAACCATCKSAGTCTGGCNGT^ 

GTGAGCTCTACGTCCAGAAGCATAACATNNGNGCGCTGCTCAAAGATT^ 
TCGACCTGAGAGACCC^TGGCATTCCTCAGGGA^ 

NATTCANAATCreCNGAAAGCANNAOTCOTACAGACTCAGGGGNGGN^ 

NCCNAtmSGTTAAGGGTNGGAGGCNAC^GGNCTNTTKWCCC^ 

ATTCCTATCTTAAAANGGGTNTTTCCNNTANNNA^ 

TOJAAAAAAANGNGOSOTTTCCNANTNTbmGNGAA 

TNNCGTTNTCTTTTTCNGGNGGAAACNTTNATTAANNCCG 

■Unigene Name: PRKARIA Unigene ID ; Hs . 183037 Clone ID : 3GD_188 
Human PJRKAJRIA mRNA sequence - varl (public gi: 23273779) (SEQ ID NO: 396) 

GGTGGAGCTGTCGCCTAGCCGCTATCGCAGAGTGGAGCGG^CTGGGAGCAAAGCGCTGAGGGAGCTCGG 

TACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCGCCCGTCCCCAGAGAACCATGGAGTCTGGC 

AGTACCGCCGCCAGTGAGGAGGCACGCAGCCTTCGAGAATGTGAGCTCTACGTCCAGAAGCATAACATTC 

AAGCGCTGCTCAAAGATTCTATTGTGCAGTTGTGCACTGCTCGACCTGAGAGACCCATGGCATTCCTCAG 

GGAATACTTTGAGAGGTTGGAGAAGGAGGAGGCAAAACAGATTCAGAATCTGCAGAAAGCAGGCACTCGT 

ACAGACTCAAGGGAGGATGAGATTTCTCCTCCTCCACCCAACCCAGTGGTTAAAGGTAGGAGGCGACGAG 

GTGCTATCAGCGCTGAGGTCTACACGGAGGAAGATGCGGCATCCTATGTTAGAAAGGTTATACCAAAAGA 

TTACAAGACAATGGCCGCTTTAGCCAAAGCCATTGAAAAGAATGTGCTGTTTTCACATCTTGATGATAAT 

GAGAGAAGTGATATTTTTGATGCCATGTTTTCGGTCTCCTTTATCGCAGGAGAGACTGTGATTCAGCAAG 

GTGATGAAGGGGATAACTTCTATGTGATTGATCAAGGAGAGACGGATGTCTATGTTAACAATGAATGGGC 

AACCAGTGTTGGGGAAGGAGGGAGCTTTGGAGAACTTGCTTTGATTTATGGAACACCGAGAGCAGCCACT 

GTCAAAGCAAAGACAAATGTGAAATTGTGGGGCATCGACCGAGACAGCTATAGAAGAATCCTCATGGGAA 

GCACACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTAGTAAAGTCTCTATTTTAGAGTCTCTGGACAA 

GTGGGAACGTCTTACGGTAGCTGATGCATTGGAACCAGTGCAGTTTGAAGATGGGCAGAAGATTGTGGTG 

CAGGGAGAACCAGGGGATGAGTTCTTCATTATTTTAGAGGGGTCAGCTGCTGTGCTACAACGTCGGTCAG 

AAAATGAAGAGTTTGTTGAAGTGGGAAGATTGGGGCCTTCTGATTATTTTGGTGAAATTGCACTACTGAT 

GAATCGTCCTCGTGCTGCCACAGTTGTTGCTCGTGGCCCCTTGAAGTGCGTTAAGCTGGACCGACCTAGA 

TTTGAACGTGTTCTTGGCCCATGCTCAGACATCCTCAAACGAAACATCCAGCAGTACAACAGTTTTGTGT 

CACTGTCTGTCTGAAATCTGCCTCCTGTGCCTCCCTTTTCTCCTCTCCCCAATCCATGCTTCACTCATGC 

AAACTGCTTTATTTTCCCTACTTGCAGCGCCAAGTGGCCACTGGCATCGCAGCTTCCTGTCTGTTTATAT 

ATTGAAAGTTGCTTTTATTGCACCATTTTCAATTTGGAGCATTAACTAAATGCTCATACACAGTTAAATA 

AATAGAAAGAGTTCTATGGAGACTTTGCTGTTACTGCTTCTCTTTGTGCAGTGTTAGTATTCACCCTGGG 

CAGTGAGTGCCATGCTTTTTGGTGAGGGCAGATCCCAGCACCTATTGAATTACCATAGAGTAATGATGTA 



TGCTGTATTTCTTTGTAGAATAAATGGTTTCTCATTAAACTCTAAAGATTAGGGAAAATGGATATAGAAA 

ATCTTAGTATAGTAGAAAGACATCTGCCTGTAATTAAACTAGTTTAAGGGTGGAAAAATGCCCATTTTTG 

CTAATTATCAATGGGATATGATTGGTTCAGTTTTTTTTTTTCCAGAGTTGTTGTTTGCCAAGCTAATCTG 

CCTGGTTTTATTTATATCTTGTTATTAATGTTTCTTCTCCAATTCTGAAATACTTTTGAGTATGGCTATC 

TATACCTGCCTTTTAAGTTTGAAACTAACTCATAGATTGCAAATATTGGTTAGTATTTAACTACATCTGC 

CTCGGCTCACAAATTCCGATTAGACCTTTATCCAGCTAGTGCCAAATAATTGATCAGATGCTGAATTGAG 

AATAAGAATTTGAGGTCTACATTCTTGGTTGTTAATTTAGAGCGTTTGGTTAAAGTATGTCCTTCAGCTG 

ACTCCAGTATAATCTCCTCTGCTCATTAAACTGATTCCAGGAGATTGGATTTGCTGTGACTAGATACAGA 

TGGAGCAAATGTCCTAACAGAGAAATAGAGGTGATGCTGCTAAAGGGAGAAATGCCAGGCGGACAAAGTT 

CAGTGTCGGGAATTTTCCCCGTGACATTCACTGGGGCATGAGATTTTGGAAGAAGTTTTTTACTTTGGTT 

TAGTCTTTTTTTCCTTCCTTTTTATTCAGCTAGAATTTCTGGTGGGTTGATGGTAGGGTATAATGTGTCT 

GTGTTGCTTCAAATTGGTCTGAAAGGCTATCCTGCGGAAAGTCCTGCTTTCCTATCTAGC^TTTATTTCT 

CTGGX^AAACTTTTCTTTCTTTTCTTTTTTAAAGTAAACTTGTGTATTGAGTCTTAACTGTATTTCAGTAT 
TTTCCAGCCTTATGTGTTACATTATTCCAATGA 

ATTTCACAGTTCTGTAATGTAGGCACTTTTATTTTCATTGTGATTTATATATAAGGTAATGTAGGGTTAT 

ATTTGGGAGTGACTGCAAGCATTTTTCCATCTGTGTGCAACTAACTGACTCTGTTATTGATCCCTTCTCC 

TGCCCTTTCCCAGGTAATTTAAATTGGTCATGGTAGATTTTTTTCATAGATTTGAAAAACTTTTAGGT 

TTACCAAGTATGAAGTATAAATCTGGGGAAGAGGTTTTATTTACATTTTAGGGTGGGTAAGAAAGCCACC 

TTGTTACAAATTTTTTAATTTCCAAAATAATCTATATTAAATGAGGGTTTCTGATCTGTACTTTGTGTTT 

AGCTACCTTTTTATATTTAAAAAATTAAAAATGAAAATTACGTTCTTACAAGCTTAAAGCTTGATTTGAT 

CTTTGTTTAAATGCCAAAATGTACTTAAATGAGTTACTTAGAATGCCATAAAATTGCAGTTTCATGTATG 
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TATATAATCATGCTCATGTATAT^ 
TCGTTAAATCATTTGGCTTGCTGTTTACTCCCTTC 

ACATTAAACAATGATCACATCTAAAGCTTTATCTTTGTGTAATCTAAGTATATGTG 
GCATAATTTGTCTTAGTTGATATTCAAGGCTTTAAAAGTCATTA 

G AGATTT ACTGCTCT AGAAAGTATAGATGGCG AAAGGAC CGTTTTGTATTG CTTC CTGATTACCAGTCTG 

ATTATACCATGTGTGCTAATATACTTTTTTTGTTATAGATTC 

AGATGAAATAATTTAAAAAAAAAAAAAAAA 

Human PRKARIA roRNA sequence - (public gi: 4506062) (SEQ ID NO: 397) 

GCTGGGAGCAAAGCGCTGAGGGAGCTCGGTACGCCGCCGCCTCGCACCCGCAGCCTCGCGCCCGCCGCCG 

CCCGTCCCCAGAGAACC^TGGAGTCTGGCAGTACCGCCGCC^GTGAGGAGGCACGCAGCCTTCGAGAATG 

TGAGCTCTACGTCCAGAAGCATAACATTCAAGCGCTGCTCAAAGATTCTATTGTGCAGTTGTGC^ 

CGACCTGAGAGACCGATGGCATTCCTCAGGGAATACTTTGAGAGGTTGGAGAAGG^ 

TTC^GAATCTGCAGAAAGCAGGCACTCGTACAGACTCAAGGGAGGATGA 

CCC^TGGTTAAAGGTAGGAGGCGACGAGGTGCTATC^GCGCTGAGGTCTAC^CGGAGGAAGATGCGGC^ 

TC CTATGTTAGAAAGGTT ATAC C AAAAG ATT ACAAG ACAATGGCCG CTTTAGC CAAAG C CATTGAAAAGA 

ATGTGCTGTTTTCACATCTTGATGATAATGAGAGAAGTGATATTTTTGATGCCATGTTTTCGG 

TATCGCAGGAGAGACTGTGATTCAGCAAGGTGATGAAGGGGATAACTTCTATGTGATTGATCAAGGAGAG 

ACGGATGTCTATGTTAACAATGAATGGGCAACCAGTGTTGGGGAAGGAGGGAGCTTTGGAGAACTTGCTT 

TG ATTT ATGGAACACCGAGAGCAG C CACTGTCAAAGC AAAGAGAAATGTGAAATTGTGG GGCAT CG AC CG 

AGACAGCTATAGAAGAATCCTGATGGGAAGCACACTGAGAAAGCGGAAGATGTATGAGGAATTCCTTAGT 

AAAGTCTCTATTTTAGAGTCTCTGGACAAGTGGGAACGTCTTACGGTAGCTGATGGATTGGAACCAGTGC 

AGTTTGAAGATGGGCAGAAGATTGTGGTGCAGGGAGAACCAGGGGATGAGTTCTTCATTATTTTAGAGGG 

GTC AGCTG CTGTGCTAG2^CGTCGGTC AGAAAATGAAGAGTTTGTTGAAGTGGGAAGATTGGGGC CTTCT 

GATTATTTTGGTGAAATTGCACTACTGATGAATCGTCCTCGTGCTGCCACAGTTGTTGCTCGTGGCCCCT 

TGAAGTGCGTTAAGCTGGACCGACCTAGATTTGAACGTGTTCTTGGCGCATGCTCAGACATCCTCAAACG 

AAAGATCCAGGAGTACAACAGTTTTGTGTCACTGTCTGTCTGAAATCTGCCTCCTGTGCCTCCCTTTTCT 

CCTCTCCCCAATCCATGCTTCACTCATGCAAAGTGCTTTATTTTC 

TGGCATCGCAGCTTCCTGTCTGTTTATATATTGAAAGTTGCTTTTATTGCACCATTTTCAATTTGGA 

TTAACTAAATGCTCATACACAGTTAAATAAATAGAAAGAGTTCTATGGAGACTTTGCTGTTACTGCTTCT 

CTTTGTGCAGTGTTAGTATTCACCCTGGGCAGTGAGTGCCATGCTTTTTGGTGAGGGCAGATCCAGCACC 

TATAAGCGTATTTAGACTGTGGCCATATATGCTGTATTTCTTTGTAGAATAAATGGTTTCTCATTAAACT 
CTAAAGATTAGGGA^TGGATATAGAAAATCTTAGTATAGTAGAAAGACATGTGCCTGTAATTA2^ACTAG 
TTTAAGGGTGGAAAAATGAAAATTTTTGCTAATTATCAATGGGATATGATTGGTTCAGTTTTTTTTTTCC 
AGAGTTGTTGTTTGCCAAGCTAATCTGCCTGGTTTATTTATATCTTGTTATTAATGTTTCTTdTCCAATT 
•CTGAAATACTTTTGAGTATGGCTATCTATACCTGCCTTTTAAGTTTGAAACTAACTCATAGATGCAAATA 
TTGGTTAGTATTTAACTACATCTGCCTCGGCTCACAAATTCCGATTAGACCTTTATCCAGCTAGTGCCAA 
ATAATTGATCAGATGCTGAATTGAGAATAAGAATTTGAGGTCTACATTCTTGGTTGTTAATTTAGAGCGT 
TTGGTTAAAGTATGTCCTTCAGCTGACTCCAGTATAATCTCCTCTGCTCATTAAA.CTGATTCCAGGAGAT 
TGGATTTGCTGTGACTAGATACAGATGGAGCAAATGTCCTAACAGAGAAATAGAGGTGATGCTGCTAAAG 
GGAGAAATGCCAGGCGGACAAAGTTCAGTGTCGGGAATTTTCCCCGTGACATTCACTGGGGCATGAGATT 
TTGGAAGAAGTTTTTTACTTTGGTTTAGTCTTTTTTTCCTCCTTTTTATTCAGCTAGAATTTCTGGTGGG 
TTGATGGTAGGGTATAATGTGTCTGTGTTGCTTCAAATTGGTCTGAAAGGCTATCCTGCTGAAAGTCCTG 
CTTTCCTATCTAGCATTTATTCCTCTGGCAAACTTTTCTTTCTTTTCT^ 

TGAGTCTTAACTGTATTTCAGTATTTTCCAGCCTTATGTGTTACATTATTCCAATGATACCCIAACAGTTT 
ATTTTTATTATTTTTTTAAACAAAATTTCACAGTTCTGTAATGTAGGCACTT 

ATATATAAGGTAATGTAGGGTTATATTTGGGAGTGACTGCAAGCATTTTTCCATCTGTGTGCAACTAACT 

GACTCTGTTATTGATCCCTTCTCCTGCCCTTTCCCAGGTAATTTA7UVTTGGTCATGGTAGATTTTTTTCA 

TAGATTTGAAA2UVCTTTTAGGTTGTTACCAAGTATGAAGTATAAATCTGGGGAAGA 

TTTAGGGTGGGTAAGAAAGCCACCTTGTTACAAATTTTTTAATTTCCA^^ 

GTTTCTGATCTGTACTTTGTGTTTAGCTACCTTTTTATATTTAAAAAATTAAA 

TACAAGCTTAAAGCTTGATTTGATCT 
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Human PTPN12 mRNA sequence - varl (pubHc gi: 292408) (SEQ ID NO: 123) 

AGCGACCGCAGCCGGGGGGACGCGGGAGGATGGAGCAAGTC^ 

CCAGGCCATGAAGAGTCCTGACCACAATGGGGAGGACAACTTCGCC 

TTGTCTACGAAATATAGAACAGAAAAGATATATCC 

AGAACAGATACAAJGGACATACTGCCATTTGATCACAGCCGAGTTA^ 

AGATTCAGACTATATCAATGCAAATTTTATAAAGGGC^ 

GGACCTTTAGCAAATACAGTAATAGATTTTTGGAGGATGATATC 

TGGCCTGCCGAGAATTTGAGATGGGAAGGAAAAAATGTGAGCGCTATTGGCC 

C^TAACGTTTGCACCATTTAAAATTTCTTGTO^ 

CTCTTACTTGAATTTCAAAATGAATCTCGTAGGCTGTATCAGTTTC^ 

ATGTTCCTTCATCATTTGATTCTATTCTGGACATG^ 

TGTTCCTATTTGTATTCATTGCAGTGCAGGCTG^ 

TGGAATTTACTAAAAGCTGGGAAAATAC(^GAGGAATTTAATGTATTTAATTTAA 
"CACAAAGGCATTCTGCAGTACAAACAAAGG7VGCAATATGAAOT 
TGAAAAACAGCTACAACTATATGAAATTCATGGAGCTCAGAAAATTGCTO 
ACTGAAAACATGATGAGCTCCATAGAGCCTGAAAAAGA^ 

CATCTTGACACCTTCTCCCCCTTCAGCTT^ 
CATCCAAAGCCAGTGTTGCATATGGTTTCATCAGAA 
AATCAACAGAACTTCCAGGGAAAAATGAATCAAG^ 
AAGTTTTGAGATTAAGAAGGTCCCTCTCCAAGAGGGACCA 

AGGGGACATGCT^TTAAAATTAAATCTGCTTCACCTTGTATAGCTGATAAAATOT 
TAAGTTCAGATCTAAATGTCGGTGATACTXCCCAGAAT^ 
- CAAAGTTTGkGTTACTCCACC^GAAGAATCCCAGAATT 

CTTGATGAGAAAGGACATGTAATOTGGTCATTTCATGGACCTGAAAATGCCATACCCATACCTGATTTAT 
CTGAAGGCAATTCCTCAGATATCAACTATCAAAC^ 

ACAAGTTGAAACACCTGATCTTGTGGATCATGATAACACTTCACCACTCTTC^ 

ACTAATCCACTTCACTCTGATGACTCAGACTCAGATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATA 
AAACTAATATTTCAAGAGGAAGTGCCACAGTTTCT 

AGTATTGCCAATGTCCATTGCTAGACATAATATAGCAGGAACAACAC^TTCAGGTGCTGAAAAAGATGTT 

GATGTTAGTGAAGATTCACCTCCTCCCCTACCTGAAAGAACTCCTGAATCGTTTGTGTTAGCAAGTGAAC 

ATAATACACCTGTAAGATCGGAATGGAGTGAACTTCATUVGTCAGGAACGATCTGAACi^ 

AGGCTTGATAACOTCTGAAAATGAGAAATGTGATCATCCAGCGGGAGGTATTCACTATGAAA 

GAATGTCCACCTACTTTCAGTGACAAGAGAGAACAAATATCAGAAAATCC^CAGAAGCCAC^GATATTG 

GTTTTGGTAATCGATGTGGAAAACCCAAAGGACCAAGAGATCCACCTTCAGAATGGACATGAT 

GCTAGAAGACACTTTAAGTTATACTGGAAAATTCAGGTGCCACTGAAAGCCAGATTTATAGTATTCCATC 

TTTAATATGTGGGACTAACAGCAGTGTAGATTGTTACCTTAATATTTTTTGCTGGGACCATCTACCTGCC 

TTATACTACACTTAGGAAAAAGTATTACATATGGTTTATTTTGAAACTTCAAGTATTATTGCCTTAATGT 

CTCTTAACCCTGTTACACGCTGCTTGTAGACATGTTAATATAGTAATACCTTTATGATATATTGAGTTTA 

AGGACTAOTCTTTTTCTGTTTTATCATGTATGCATTATTTTGTATATGTACAGGGCAAGTAGGTATATAA 

TTTGATAAAGTTGCAATTGAAATATTATTAACAGAAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTG 

TGTACTTTATTTGTAAATTATTTGCCCTGGAGTTTTAGAAAATAGTTTCTGAATTTTAAACTTGCTGGAT 

TCATGCTVGCCAGCTTTGCAGGTTATCAGAGATCAAAGATTGTAATAATAATTTTGTAAATTGTAAGCAA 

AAGTTATTTTTATATTATATACAGTCTAATTGTTCATCCTAATTGTTCCTGTTTTCATCTAGTCAGAGAT 

TCAGTAAGTGCCTTGGAACAATATTGAATTCTCTTAGCTTGTGTGTGTTTCTTTAATATTTGAACTCAAG 

TGGGATTAGAAGACTATCAAAATAC^TGTATO^ 

TTTTACAGTG 

Human PTPN12 mRNA sequence - var2 (pubHc gi: 29476876) (SEQ ID NO: 124) 

GGGGAGAGGCGGCTGCGGCTGCGGCTGCGGCTGCTGGCGGGGGGTGGGGGGGAGGAGGAACCGGGAAGGG 
GGGGCAGGGCGAGCGGAGAGCTAGCTGTGTTCCTGAGGCGCAGCCGCCGCCTAGGGCGGTGGGGAGGAGG 
AGGGAGCCGCGGGGCTTGGCGGGGTCGGGAGGGAGGGACGTGCTGGGGGAACGAGCTGGGGAAGACGGAG 
CGGGCTCTGTGCCGGGCGGGCGGGCGGCGGGGGGGCCAGCGACCGCAGCCGGGGGGACGCGGGAGGATGG 
AGCAAGTGGAGATCCTGAGGAAATTCATCCAGAGGGTCCAGGCGATGAA 

GGACAACTTCGCCCGGGACTTCATGCGGTTAAGAAGATTGTCTACCAAATATAGAACAGAAAAGATATAT 
CCCACAGCCACTGGAGAAAAAGAAGAAAATGTTAAAAAGAACAGATACAAGGACATACTGCCATTTGATC 
ACAGCCGAGTTAAATTGACATTAAAGACTCCTTCACAAGATTCAGACTATATCAATGCAAATTTTATAAA 
<X3GCGTCTATGGGCCAAAAGCATATGTAGCAACTCAAGGACCT 

AGGATGATATGGGAGTATAATGTTGTGATCATTGTAATGGCCTGCCGAGAATTTGAGATGGGAAGGAAAA 

AATGTGAGCGCTATTGGCCTTTGTATGGAGAAGAC^ 

GGATGAACAAGGAAGAAGAGACTACTTCATGAG 

CTGTATCAGTTTCATTATGTGAACTGGCCAGACCATGATGTTCCTTCATCATTTGATTCTATTCTGGACA 
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TGATAAGCTTAATGAGGAAATATCAA^ 

TGGAAGAACAGGTGCCATTTGTGCCATAGATTATACGTGGAAT^ 

GAATTTAATGTATTTAATTTAATACAAGLAAATGA 

AATATGAACTTGTTCATAGAGCTATTGCCCAACTGTTTGAAA 

AGCTCAGAAAATTGCTGATGGAGTGAATGAAATTAACACTGAAAA 

AAACAAGATTCTCCTCCTCCAAAACCACCAAGGACCCGCAGTTGCCTT^ 

AAATACTGCAGCCACCGGAACCTCATCCAGTGCCACCGATCTT^ 

AACAGTCACTACTGTGTGGCAGGA.CAATGATAGATACCAT CCAAAGC CAGTGTTGCATATGGTTTCATCA 
GAACAACATTCAGCAGACCTCAACAGAAACTATAGTAAATCAACAGAACT 
CAATTGAACAGATAGATAAAAAATTGGAACGAAATTTAAGTTTTGAGATTAAGAAGGTCC 
GGGACCAAAAAGTTTTGATGGGAAGACACTTTTGAATA^^ 

CCTTGTATAGCTGATAAAATCTCTAAGCCACAGGAATTAAGTTCAGATCTAAATC 

AGAATTCTTGTGTGGACTGCAGTGTAACACAATCAAACAAAGTTTCAGTTACT 

GAATTCAGACACACCTCCAAGGCCAGACCGCTTGCOTOTTGATGAGAAAGGAC^ 

CATGGACCTGAAAATGCC^TACCCATACCTGATTTATCT^ 

CTAGGAAAACTGTGAGTTTAACACCAAGTCCTACAACACAAGTTGAAA 

TAACACTTCACCACTCTTGAGAACACCCCTCAGTTTTACT^ 

GATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATAAAACTAAT 

CTCCTGCGACTAGTACTGAAAGCATTTCTACT^ 

AGCAGGAACAACACATTCAGGTGCTGAAAAAGATGTTGATGTT^^ 

GAAAGAACTCCTGAATCGTTTGTGTTAGCAAGTGAACATAATACACCTGTAAGATCGGAATGGAGTGAAC 
TTCAAAGTCAGGAACGATCTGAACAAAAAAAGTCTGAAGGCTTGATAACOT 
TCATCGAGCGGGAGGTATTCACTATGAAATGTGCATAGAATGTC CAC GTACTTTCT 
CAAATATCAGAAAATCCAACAGAAGCCAGAGATATTGG 

CAAGAGATCCACCTTCAGAATGGACATGATTCAGGGAGCTAGAAGACACTTTAAGTTATACTGGAAAATT 

CAGGTG C CACTGAAAGCCAGATTT ATAGTATTCCATCTTTAATATGTGGGACTAACAGCAGTGTAGATT 

TTACCTTAATATTTTTTGCTGGGACCATCTACCTGCCTTATACTACACTTAGGAAAAAGTATTACATATC 

GTTTATTTTGAAACTTCAAGTATTATTGCCTTAATGTCTCTTAACCCTGTTACACGCTGCTTGTAGACAT 

•GTTAATATAGTAATACGTTTATGATATATTGAGTTTAAGGACTACTCTTTTTCTGTTTTATCATGTATGC 

ATTATTTTGTATATGTACAGGGCAAGTAGGTATATAATTTGATAAAGTTGCAATTGAAATATTATTAACA 

GAAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTGTGTACTTTATTTGTAAATTATTTGCCCTGGAGT 

TTTAGAAAATAGTTTCTGAATTTTAAACTTGCTGGATTCATGCAGCCAGCTTTGCAGGTTATCAGAGATC 

AAAGATTGTAATAATAATTTTGTAAATTGTAAGGAAAAAGTTATTTTTATATTATATACAGTCTAATTGT 

TCATCCTAATTGTTCCTGTTTTCATCTAGTCAGAGATTCAGTAAGTGCCTTGGAACAATATTGAATTCTC 

TTAGCTTGTGTGTGTTTCTTTAATATTTGAACTCAAGTGGGATTAGAAGACTATCAAAATACATGTATGT 

TTCAGGATATTTGACCTGTCATTAAAAAAAACAAACAGTTTTACAGTGCCAAAAAAAAAAAAAA^ 

Human PTPN12 nJRNA sequence - var3 (public gi: 18375651) (SEQ ID NO: 125) 

AGCGACCGCAGCCGGGGGGACGCGGGAGGATGGAGCAAGTGGAGATCCTGAGGAAATTCATCCAGAGGGT 
CCAGGCCATGAAGAGTCCTGACCACAATGGGGAGGACAACTTCGCCCGGGACTTCATGCGGTTAAGAAGA 
TTGTCTACCAAATATAGAACAGAAAAGATATATCCCACAGCCACTGGAGAAAAAG7U\GAAAATGTTAAAA 
AGAACAGATACAAGGACATACTGCCATTTGATCACAGCCGAGTTAAATTGACATTAAAGACTCCTTCACA 
AGATTCAGACTATATCAATGCAAATTTTATAAAGGGCGTCTATGGGCCAAAAGCATATGTAGCAACTCAA 
GGACCTTTAGCAAATACAGTAATAGATTTTTGGAGGATGATATGGGAGTATAATGTTGTGATCATTGTAA 
TGGCCTGCCGAGAATTTGAGATGGGAAGGAAAAAATGTGAGCGCTATTGGCCTTTGTATGGAGAAGACCC 
CATAACGTTTGCACCATTTAAAATTTCTTGTGAGGATGAACAAGCAAGAAC^ 

CTCTTACTTGAATTTCAAAATGAATCTCGTAGGCTGTATCAGTTTC7VTTATGTGAACTGGCCAGACCA 

ATGTTCCTTCATCATTTGATTCTATTCTGGACATGATAAGCTTAATGAGGAAATATCAAGAACATGAAGA 

TGTTCCTATTTGTATTC^TTGCAGTGCAGGCTGTGGAAGAACAGGTGCCATTTGTGCCATAGATTATACG 

TGGAATTTACTAAAAGCTGGGAAAATACCAGAGGAATTTAATGTATTTAATTTAATACAAGAAATGAGAA 

CACAAAGGCATTCTGCAGTACAAACAAAGGAGCAATATGAACTTGTTCATAGAGCTATTGCCCAACTGTT 

TGAAAAACAG CTACAACTATATGAAATTCATGGAGCTCAGAAAATTG CTGATGGAGTGAATGAAATTAAC 

ACTGAAAACATGATCAGCTCCATAGAGCCTGAAAAACAAGATTCTCCTCCTCCAAAACCACCAAGGACCC 

GGAGTTGCCTTGO?TGAAGGGGATGCTAAAGAAGAAATACTGCAGCCACCGGAACCTGATCCAGTGCCACC 

CATCTTGACACCTTCTCCCCCTTCAGCTTTTCCAACAGTCACTACTGTGTGGCAGGACAATGATAGATAC 

CATCCAAAGCCAGTGTTGCATATGGTTTCATCAGAACAACAT^ 

AATCAACAGAACTTCCAGGGAAAAATGAATCAACAATTGAACAGATAGAT^ 

AAGTTTTGAGATTAAGAAGGTCCCTCTCCAAGAGGGACCAAAAAGTTTTGATGGGAACACACTTTTGAAT 

AGGGGACATGCAATTAAAATTAAATCTGCTTCACCTTGTATAGCTGATAAAATCrrCTAAGCCACAGG 

TAAGTTCAGATCTAAATGTCGGTGATACTTCCCAGAATTCTTGTGTGGACTGCAGTGTAACACAATCAAA 

C^UUVGTTTCAGTTACTCCACCAGAAGAATCCCAGAATTCAGACACACCTCCAAGGCCAGACCGCTTGCCT 

CTTGATGAGAAAGGACATGTAACGTGGTCATTTCATGGACCTGAAAATGCCATACCC^TACCTGATTTAT 

CTGAAGGCAATTCCTCAGATATCAACTATCAAACTAGGAAAACTGTGAGT^ 
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ACAA.GTTGAAACACCTGATCTTGTGGATCATGATAACA.CTT 

ACTAATCCA.CTTCACTCTGATGACTCA.GACT 

AAACTAATATTTCAACAGCAAGTGCCACAGTTTCT 

AGTATTGCCAATGTCCATTGCTAGACATAATATAGCAGGAACAACACAT^ 

GATGTTAGTGAAGATTCACCTCCTCCCCTACCTGAAAGAACT 

ATAATACACCTGTAAGATCGGAATGGAGTGAACTTCAAAGTCAGGAA 

AGGCTTGATAACCTCTGAAAATGAGAAATGTGATCATCCA^ 

GAATGTCCACCTACTTTCAGTGACAAGAGAGAACAAATATCAGAAAATC 

GTTTTGGTAATCGATGTGGAAAACCCAAAGGACCAAGAGATCC^ 

GCTAGAAGACACTTTAAGTTATACTGGAAAATTCAGGTGCCAC^^ 

TTTAATATGTGGGACTAAGAGCAGTGTAGATTGTTACCTTAATATTTTTTG 

TTATACTACACTTAGGAAAAAGTATTACATATGGT^ 

CTCITAACCCTGTTACACGCTGCTTGTAGACATO 

AGGACTACTCTTTTTCTGTTTTATCATGT^^ 

TTTGATAAAGTTGCAATTGAAATATTATTAACAGAAGATGTAAGAA^ 
TGTACTTTATTTGTAAATTATTIX3CCCTGGAGTT 

TCATGCAGCC^GCTTTGCAGGTTATCAGAGATCAAAGATTGTAATAATAATTTTGTAAATTGTAAGCAAA 

AAGTTATTTTTATATTATATACAGTCTAATTGrrCATCCTAATTGTTCCTGTTTTCATCTAGTCAGAGAT 

TC^GTAAGTGCCTTGGAACAATATTGAATTCTCT 

TGGGATTAGAAGACTATCAAAATACATGTATGTTTC^ 

GTTTTACAGTG 

Human PTPN12 mRNA sequence - var4 (public gi: 545651) (SEQ ID NO: 126) 

GTTAAAAGGAACAGATAC^^GGACATACTGCCATTTGATCACAGCCGAGTTAAATTGACATTAAAGACTC 
CTTCACAAGATTCAGACTATATCAATGCAAATTTTATAAAGGGCGTCTATGGGC 

AACTCAAGGACCTTTAGCAAATACAGTAATAGATTTTTGGAGGATGGTATGGGAGTATAATGTTGTGATC 
ATTGTAATGGCCTGCCGAGAATTTGA 

Human PTPN12 mRNA sequence - var5 (public gi: 19683 965) (SEQ ID NO: 127) 

GGGACTTCACC^CTCTTCAGAACACCCCTCAGTTTTAGTAATCCACTTCACTCTGATGA 

ATGAAAGAAACTCTGATGGTGCTGTGACCCAGAATAAAACTAATATTTCAACAGCAAGTGCCACAGTTTC 

TGCTGCCACTAGTACTGAAAGCATTTCTACTAGGAAAGTATTGCCAATGTCCATTGCTAGACATAATATA 

GCAGGAACAACACATTCAGGTGCTGAAAAAGATGTTGATGTTAGTGAAGATTCACCTCCTCCCCT^ 

AAAGAACTCCTGAATCGTTTGTGTTAGCAAGTGAACATAATACACCTGTAAGATCGGAATGGAGTGAACT 

TCAAAGTCAGGAACGATCTGAACAAAAAAAGTCTGAAGGCTTGATAACCTCTGAAAATGAGAAATGTGAT 

CATCCAGCGGGAGGTATTCACTATGAAATGTGCATAGAATGTCCACCTACTTTCAGTGAGAAGAGAGAAC 

AAATATCAGAAAATC CAACAGAAGCCACAGATATTGGTTTTGGTAATCGATGTGGAAAAC CCAAAGGACC 

AAGAGATCCACCTTCAGAATGGACATGATT C AGGGAG CT AGAAGACACTTTAAGTTATACTGGAAAATTC 

AGGTGCCACTGAAAGCCAGATTTATAGTATTCCATCTTTAATATGTGGGACTAACAGCAGTGTAGATTGT 

TACCTTAATATTTTTTGCTGGGACC^TCTACCTGCCTTATACTACACTTAGGAAAAAGTATTACATATGG 

TTTATTTTGAAACTTCAAGTATTATTGCCTTAATGTCTCTTAACCCTGTTACACGCTGCTTGTAGACATG 

TTAATATAGTAATACCTTTATGATATATTGAGTTTAAGGACTACTCTTTTTCTGTTTTATCATGTATGCA 

TTATTTOGTATATGTACAGGGCAAGTAGGTATATAA 

AAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTGTGTACTTTATTTGTAAATTATTTGCCCTGGAGTT 
TTAGAAAATAGTTTCTGAATTTTAAACTTGCTGGATO 

AAGATTGTAATAATAATTTTGTAAATTGTAAGCAAAAAGTTATTTTTATATTATATACAGTCTAATTGTT 

CATCCTAATTGOTCCTGTTTTCATCTAGTCAGAGATTCAGTAAGTGCCTTGGAACAATA 

TAGCTTGTGTGTGTTTCTTTAATATTTGAACTCAAGTGGGATTAGAAGACTATCAAAATACATGTATGTT 

TI^GGATATTTGACCTGTCATTAAAAAAAACAAACAGTTTTACAATAAAAAAAAAA^^ 

AAAAAA 

Human PTPN12 mRNA sequence - var6 (public gi: 220033) (SEQ ID NO: 128) 

GCCGGGGGGACGCGGGAGGATGGAGCAAGTGGAGATCCTGAGGAAATTCATCCAGAGGGTCCAGGCCATG 
AAGAGTCCTGACCACAATGGGGAGGACAACTTCGCCCGGGACTTCATGCGGTTAAGAAGATTGTCTACCA 
AATATAGAACAGAAAAGATATATCCCACAGCCACTGGAGAAAAAGAAGAAAATGTTAAAAAGAACAGATA 
CAAGGACATACTG CCATTTGATCACAG CCGAGTTAAATTGACATTAAAGACTCCTT CACAAGATT CAGAC 
TATATCAATGCAAATTTTATAAAGGGCGTCTATGGGCCAAAAGCATATGTAGCAACTCAAGGACCTTTAG 
CAAATACAGTAATAGATTTTTGGAGGATGGTATGGGAGTATAATGTTGTGATCATTGTAATGGCCTGCCG 
AGAATTTGAGATGGGAAGGAAT^AAATGTGAGCGCTATTGGCCTTTGTATGGAGAAGACCCCATAACGTTT 
GC^CCATTTAAAATTTCTTGTGAGGATGAACAAGCAAGAACAGACTACTTCATCAGGACACTCTTACTTG 
AATTTCAAAATGAATCTCGTAGGCTGTATCAGTTTCATTATGTGAACTGGCCAGACCATGATGTTCCTTC 
ATCATTTGATTCTATTCTGGACATGATAAGCTTAATGAGGAAATATCAAGAACATGAAGATGTTCCTATT 
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TGTATTCATTGCAGTGCAGGCTCTG^ 

TAAAAGCTGGGAAAATACC^GAGGAATTTAATGTATTTAATTT^ 
TTCTGCAGTACAAACAAAGGAGCAATATGAACTTGTTCATAGAGCTATTGCCCAACTC 
CTACAACTATATGAAATTCATGGAGCTCAGAAAATTGCr^ 
TGGTCAGCTCCATAGAGCCTGAAAAACAAGATTCT 

TGTTGAAGGGGATGCTAAAGAAGAAATACTGCAGCCACCGGAACCTCATCCAGTGCCACC^ 

CCTTCTCCCCCTTC^GCTTTTCO^ACAGTCACTA 

CAGTGTTGCATATGGTTTCATCAGAACAACATT 

ACTTCCAGGGAAAAATGAATCAACAATTGAACAGA™ 

ATTAAGAAGGTCCOTCTCCAAGAGGGACCAAAAAGTTTTGATO 

CAATTAAAATTAAATCTGCTTCACCTTGTATAGCTGATAA^TCTCTAAGCCA 

TCTAAATGTCXK3TGATACTTCCCAGAATTCTTGTGTGGACT 

GTTACTC(^CCAGAAGAATCCCAGAATTCAGACACACCTCCAAGGCCAGACCGCT 

AAGGACATGTAACGTGGTCATTTCATGGACCTGAAAATGCCATACCCATACCTGATTTATCT 

TTCCTCAGATATCAACTATCAAACTAGGAAAACTGTGAGTTTAACACCAAGTCCT 

AGACCTGATCTTGTGGATCATGATAACACTTCACCACT 

TTCACTOTGATGACTCAGACTCAGATGAAAGAAACTCTGATGGTGCTGTGA 

TTCAACAGCAAGTGCCACAGTTTCTGCTGC 

ATGTCCATTGCTAGACATAATATAGCAGGAACAACACATTCAGGTGCT^ 

AAGATTCACCTCCTCCCCTACCTGAAAGAACTCCTGAATCGTTTGTGTTAGCAAGTGAACATAATA(^CC 
TGTAAGATCGGAATGGAGTGAACTTCAAAGTCAGGAACGATCTGAACAAAAAAAGTCTG 
ACCTCTGAAAATGAGAAATGTGATCATCCAGCXSGGAGGTATTCACrATGAAATGTGCA 
CTACTTTGAGTGACAAGAGAGAACAAATATCAGAAAATCCAA<^ 

TCGATGTGGAAAACCCAAAGGACCAAGAGATCCACCTTCAGAATGGACATGATTCAGGGAGCT 

ACTTTAAGTTATACTGGAAAATTCAGGTGCCACTGAAAGCCAGATTTATAGTATTCCATCT 

GGGACTAACAGCAGTGTAGATTGTTACCTTAATATTTTTTGCTGGGACCATCTACCTGCCTTATACTACA 

CTTAGGAAAAAGTATTACATATGGTTTATTTTGAAAOTTCAAGTATTATTGCCTTAATGTCTC^ 

TGTTACACGCTGCTTGTAGACATGTTAATATAGTAATACCTTTATGATATATTGAGTTTAAGGACTACCC 

TTTTTCTGTTTTATCTITGTATTCATTATTTTGTATATGTACAGGGC^^ 

TTGCAATTGAAATATTATTAACAGAAGATGTAAGAAATTTCTGCATGGTCTAAATCTTTGTGTACTTTAT 
TTGTAAATTATTTGCCCTGGAGTTTTAGAAAATAGTTTCTGAATTTTAAACTTGCTGGATTCATGCA.GCC 
AGCTTTGCAGGTTATCAGAGATCAAAGATTGTAATAATAATTTTGTAAATTGTAAGCAAACATTCTGC 

Human PTPN12 protein sequence - varl (public gi: 220034) (SEQ ID NO: 266) 

•MEQVEI LRKFI QRVQAMKS PDHNGEDNFARDFMRLRRLSTKYRTEKI YPTATGEKEENVKKNRYKD I LPF 
DkSRVKLTLKTPSQDSDY INANPI KGVYGPKAYVATQGPLANTVI DFWRIWWE YNVVI I VMACRE FEMGR 
KKCERYWPLYGEDPITFAPFKISCEDEQARTDYFIRTLLI1EFQNESRRI1YQFHYVNWPDHDVPSSFDSIL 
DMI SliMRKYQEHEDVP I CI HCS AGCGRTGAI CAI DYTWNLLKAGKI PEEFNVFNLIQEMRTQRHSAVQTK 
EQYELVHRAI AQLFEKQLQLYE I HGAQKI ADGVNE INTENMVS S I E PEKQDS P PPKP PRTRS CLVEGDAK 
EE I LQPPE PHPVP P I LTPS PPS AFPTVTTVWQDNDRYHPKP VLHMVS S EQHS ADLNRNYS KSTELPGKNE 
STIEQIDKKLERNLSFEIKKVPLQEGPKSFDGNTLLNRGHAIKIKSASPCIADK^ 

SQNSCTOCSVTQSNKVSVTPPEESQNSDTPPRPDRLPLDEKGHVTWSFHGPENAIPIPDLSEGNSSDINY 
QTRKTVSLTPSPTTQVETPDLVDHDNTSPLFRTPLSFTNPLHSDDSDSDERNSDGAVTQNKTNISTASAT 
VSAATSTES I STRKVLPMS IARHNIAGTTHSGAEKDVDVSEDSPPPLPERTPESFVLASEHNTPVRSEWS 
ELQSQERSEQKKSEGLITSENEKCDHPAGGIHYEMCIECPPTFSDKREQISENPTEATDIGFGNRCGKPK 
GPRDPPSEWT 

Huaian PTPN12 protein sequence - var2 (public gi: 7689910) (SEQ ID NO: 267) 

VKRNRYKDILPFD^SRVKLTLKTPSQDSDYINANFIKGVYGPKAYVA^ 
IVMACREF 

Human PTPN1 2 protein sequence - var3 (public gi: 292409) (SEQ ID NO: 268) 

MEQVE I IjRKFIQRVQAMKS PDHNGEDNFARDFMRLRRLSTKYRTEKI YPTATGEKEENVKKNRYKDI LPF 
DHSRVKLTLKTPSQDSDYINANFI KGVYGPKAYVATQGPLANTVIDFWRMIWEYNWI I VMACREFEMGR 
KKCERTOPLYGEDPITFAPFKISCEDEQARTDYFIRTLLLEFQNESRRLYQFH^ 

DMI S LMRKYQEHEDVPI CI HCS AGCGRTGAI CAIDYTWNLLKAGKI PEEFNVFNL I QEMRTQRHSAVQTK 
EQYELVHRAIAQLFEKQLQLYEIHGAQKIADGVNEINTENMISSIEPEKQDSPPPKPPRTRSCLVEGDAK 
EEILQPPEPHPVPPILTPSPPSAFPTVTTVWQDNDRYHPKPVLH^ 

STIEQIDKKLERNLSFEIKKVPLQEGPKSFDGNTLLNRGHAIKIKSASPCIADKISKPQELSSDLNVGDT 
SQNSCVDCSVTQSNKVSVTPPEESQNSDTPPRPDRLPLDEKGHVTWSFHGPENAIPIPDLSEGNSSDINY 
QTRK^SLTPSPTTQVETPDLVDHDNTSPLFRTPLSFTNPLHSDDSDSDERNSDGAVTQNKTOISTASAT 
VSAATSTES I STRKVLPMS IARHNI AGTTHSGAEKDVDVS EDSPP PLPERTPES FVLASEHNTPVRSEWS 
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ELQSQERSEQKKSEGLITSENEKCDHPAGGIHYEMCIECPPTFSDKREQISEOTTEATDIGFGNR(^KPK 
GPRDPPSEWT 

Human PTPN12 pray sequence*- varl (SEQ ID NO: 129) 

GTTTGGNATNCTAGAGGGHATGTTTAATAC 

GAAGATACCCCACCAAACCCAAAAAAAGAGATCTTTAA 

AGTACCCATACGACX3TACCAGATTACGCT<^TA^ 

TATCAATOCAGAGTGGAATTATGGGCGGCGGCGGCGGCGTOTCOS 

GATGGAGCAAACAAAGTTTCAGTTACTCCACCAGAAGAATCCCAG^^ 

ACCGCTTGCCTCTTGATGAGAAAGGACATGTAACGTGGTCAT 

ACCTGATTTATCTGAAGGCAATTCCTCAGATATCAACTATCAAACT 

AGTCCTACAACAGAAGTTGAANGGACCTGATCTTC 

CCCCTCANTTTTACTAATCCACTTCACTC 

TGAC CCANAATAAAACTAATATTTCAACAGCAAGTG CCACAGTTTCTGCTGCCACTANTACTGAAAGCAT 
TTCTACTAGGAAAGTATTGCCNATGTCCATTGCTAGA 

AAAAAAANTTNATGTTNNTGAANATTNCCTN CTCCCNNCCnSTAANAACTC C ' 

Unigene Name ; RAIiA Unigene ID : Hs.6906 Clone ID : 3GD_1106 
Human RALA mRNA sequence - varl (public gi: 35845) (SEQ ID NO: 130) 

ATGGCTGCAAATAAGCCCAAGGGTCAGAATTCTTTGGCT^ 

GCX5TGGGCAAGTCAGCTCTGACTCTACAGTTCATGTACGATGAGTTTGTGGAGGACTATGAGCCTACC^ 
AGCAGACAGCTATCGGAAGAAGGTAGTGCTAGATGGGGAGGAAGTCCAGATCGATATCTTAGATACAGCT 
GGGCAGGAGGACTACGCTGCAATTAGAGACAACTACTTCCGAAGTGGGGAGGGGTTCCTCTGTGTTTTCT 
CTATTACAGAAATGGAATCCTTTGCAGCTACAGCTGACTTCAGGGAGCAGATTTTAAGAGTAAAAGAAGA 
TGAGAATGTTCCATTTCTACTGGTTGGTAACAAATGAGATTT^ 

GAGGCAAAAAACAGAGCTGAGCAGTGGAATGTTAACTACGTGGAAAGATCTGCTAAAAC^ 

TTGACAAGGTATTTTTTGATTTAATGAGAGAAATTCGAGCGAGAAAGATGGAAGACAGCAAAGAAAAGAA 

TGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAG^T(^GAGAAAGATGCTGC^TTTTATAA 

Human RALA mRNA sequence - var2 (public gi: 24980846) (SEQ ID NO: 131) 

CCGCTCCCCAGAGCA^VAGCGTCGGAGTCCTCCTCCTCCTTCTCCTCCTCCTCCTCCTCCTCCTCCAGCCG 
CCC^GGCTCCCCCGCCACCCGTCAGACTCCTCC^ 

GGACCGAGTACCCTCCGGCCGGAGC^CGC^GCCGCGGCTTCCGGAGCCCTCGGGGCGGCGGACTGGCTC 
GCGGTGCAGATTCTTCTTAATCCTTTGGTGA 

CAGAATTCTTTGGCTTTACACAAAGTCATCAT 

TACAGTTCATGTACGATGAGTTTGTGGAGGACTATGAGCCTACCAAAGCAGACAGCTATCGGAAGAAGGT 

AGTGCTAGATGGGGAGGAAGTCCAGATCGATATCTTAGATACAGCTGGGCAGGAGGACTACGCTGCAATT 

AGAGAC^CTACTTCCGAAGTGGGGAGGGGTTCCTCTGTGTTTTCTCTATTACAGAAATGGAATCCTTTG 

CAGCTACAGCTGACTTCAGGGAG(^GATTTTAAGAGTAAAAGAAGATGAGAATGTTCCAT^^ 

TGGTAACAAATCAGATTTAGAAGATAAAAGACAGGTTTCTGTAGAAGAGGCAAAAAACAGAGCTGAGCAG 

TGGAATGTTAACTACGTGGAAACATCTGCTAAAACACGAGCTAATGTTGACAAGGTATTTTTTGATTTAA 

TGAGAGAAATTCGAGCGAGAAAGATGGAAGACAGCAAAGAAAAGAATGGAAAAAAGAAGAGGAAAAGTTT 

AGCCAAGAGAATCAGAGAAAGATGCTGCATTTTATAATCAAAGCCGAAACTCCTTTCTTATCTTGACCAT 

ACTAATAAATATAATTTATAAG(^TTGC(^TTGAAGGCTTAATTGACTGAAATTACTTTAACATTTTGGA 

AATTGTTGTATATCACTAAAAGCATGAATTGGAACTGCAATGAAAGTCAAATTTACTTTAAAAAGAAAT 

AATATGGCTT(^CC7UIGAAGCAAAGTTCAACTTATTTCATA 

TAGCGTGTAAGCTTGTGTTTCTTGGGCAGTCTTTCTTGAAATTGAAGAGGTGA^ 

GGAGGAAAGGTGACTTCCTCTGGTGTTTATTATAAAGCTTAAATTTTATATCATTTTAAAATGTCTTO^ 
CTTCTACTGCCTTGAAAAATGACAATTGTGAACATGATAGTTAAAjC 

TG CAAAATTTAGAAGAAAAGTTAT TGGCATGGTTGTTGCATATAGTTAAACTGAGAGTAATTCAT CTGTG 
AATCTGCTTTAATTACCTGGTGAGTAACTTAGAAAAGTGGTGTAAACTTGTACATGGAATTTTTTGAATA 
TGCCTTAATTTAGAAACTGAAAAATATCTGGTTATATCATTCTGGGTGTGTTCTTACTGACACCAGGGGT 
CCGCTGCCCCATGTGTCCTGGTGAGAAAATATATGCCTGGCACAGCTTTTGTATAGAAAATTCTTGAGAA 
GTAACTGTCCGCTAGAAGTCTGTCCAAATTTAAAATGTGTGCCATATTCTGGTTCTTGAAAATAAGATTC 
CAGAGCTCTTTGATCGCTTTTAATAAACTGCAAGTTCATTTTAAATGAAGGGCCAGCATATATACTTGCA 
AGATAATTTTCAGCTGCAAGGATTCAGCA'CCAGTTATGTTTGAATGAACCCTCCTTTTCTCTGAGATTCT 
GGTCCCTGGAAATCCCTTTCTGCTAGTGGTGAGCATGTAAGTGTTAAGTTTTTAATCTGGGAGCAGGGCA 
TAGGAAGAAAATGTCAGTAGTGCTAATGCATTTTGCACTAGAACGCTTCGGGAAAATATTCATGCTTGCC 
ATCTGTTCATTTCTAAATTTATATTCATAAAGTTACAGTTTGATACAGGAATTATTAGGAGTAATTCTTT 
TCTGTTTCTGTTTATAATGAAGAACACTGTAGCTACATTTTCAGAAGTTAACATCAAGCCATCAAACCTG 
GGTATAGTGCAG7^AAACGTGGCACACACTGACCAC!ACATTAGGCTGTGTCACCATTGTGTGGTGTACCTG 
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CTGGAAGAATTCTAGCATGCTACTTGGGGACATAATTTCAG 

TTTTTCCTCTTTGCAGTGGGGCTAGGACAGT^ 

ATTTGAACAGGTCAAAGATGTGTTCAGGCATTCCAGGTA 

TTTTAGGAACTCACTCTTTAGATATTTACATCCAGC^ 

AGATGTACATCTTTCATTTCX5TATTTCT^ 

ACTGGCC^GCGGGCCC^CAATCTCCATGTGTACTTATTA 

TAACATTGTGACTTTGCTTTGAGACCTTTCCTCT 

GATGCATCACGTGTCTTAGGCTGATGCCACTACCCGATT 

C^TAAACTTCCTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human RALAmRNA sequence- var3 (publicgi: 3483427) (SEQ ID NO: 132) 

ATAATCAAAGKTCCAAACTCCTTTCTTATCTTGACCAT^ 
AAGGCTTAATTGACTGAAATTACTTTAACATT^ 
ACTCCAATGAAAGTCAAATTTACTTTAAA^ 
ATTTCATAATTGCCTACATTTATCATGGTCCTGAATC 

TCTTGAAATTGAAGAGGTGAAATGGGGGTGGGGAGTGGGAGGAAAGGTGACTTCCTC 

AAAGCTTAAATTTTATATCATTTTAAAATGTCTTGGTCTTCTACTGCCTTGAAAAATGAC^ 

ATGATAGTTAAACTACCACTTTTTTTAACCATTATTATGCAAAAAAAAAA 

Human RALA inRNA sequence - var4 (public gi: 20147712) (SEQ ED NO: 133) 

ATGGTCGACTACCTAGCAAATAAGCCCAAGGGT^ 

GCAGTGGTGGCGTGGGCAAGTCAGCTCTGACTCTACAGTTCATGTACGATGAGTTTGTGGAGGACTATGA 
GCCTACCAAAGCAGACAGCTATCGGAAGAAGGTAGTGCTAGATC 

GATACAGCTGGG CAGG AGGACTACGCTGCAATTAGAGACAACTACTT C CGAAGTGGGGAGGGGTTCCTCT 

GTGTTTT CTCTATTAC AGAAATGGAAT CCTTTG CAGCTACAGCTGACTT CAGGGAGCAGATTTTAAGAGT 

AAAAGAAGATGAGAATGTTCCATTTCTACTGGTTGGTAACAAATCAGATTTAGAAGATAAAAGA 

TCTGTAGAAGAGGCAAAAAACAGAGCTGAGCAGTGGAATGTTAACTACGTGGAAACATCTGCTAAAACAC 

GAGCTAATGTTGACAAGGTATTTTTTGATTTAATGAGAGAAATTCGAGCGAGAAAGATGGAAGACAGCAA 

AGAAAAGAATGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCAGAGAAAGATGCTGCATT^ 

Human RALA mRNA sequence - var5 (public gi: 10439805) (SEQ ID NO: 134) 

AGAATGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCAGAGAAAGA 

GCCCAAACTCCTTTCTTATCTTGACCATACTAATAAATATAATTTATAAGCATTGCCAT 

TTGACTGAAATTACTTTAACATTTTGGAAATTC 

AAAGTGAAATTTACTTTAAAAAGAAATTAATATGGCTTGACCAA 

TTGCCTACATTTATCATGGTCCTGAATGTAGCGTGTA 

TGAAGAGGTGAAATGGGGGTGGGGAGTGGGAGGAAAGGTGACTTCCTCTC^TGTTTATTATAAAGCTTAA 
ATTTTATATCATTTTAAAATGTCTTGGTCTTCTACT 

AAACTACCACTTTTTTTAACCATTATTATGCAAAATTTAGAAGAAAAGTTATTGGCATGGTTGTTGCATA 

TAGTTAT^CTGAGAGTAATTCATCTGTGAATCrGCTTTAATTACCTGGTGAGTAACTTAGAAAAGTGGTG 

TAAACTTGTACATGGAATTTTTTGAATATGCCTTAATTTAGAAACTGAAAAATATCCGGTTATATCATTC 

TGGGTGTGTTCTTACTGACACCAGGGGTCCGCTGCCCCATGTGTCCTGGTGAGAAAATATATGCCTGGCA 

CAGCTTTTGTATAGAAAATTCTTGAGAAGTAACTGTCCGCTAGAAGTCTGTCCAAATTTAAAATGTGTGC 

CATATTCTGGTTCTTGAAAATAAGATTCCAGAGCTCTTTGATCGCTTTTAATAAACTGCAAGTTCA 

AATTGAAGGGCCAGCATATATACTTGCAAGATAATTrTCAGCTGCAAGGATTCAGCACCAGTTATGTTTG 

AATGAACCCTCCTTTTCTCTGAGATTCTGGTCCCTGGAAATCCCTTTCTGCTAGTGGTGAG 

GTTAAGTTTTTAAT CTGGGAGCAGGG CATAGGAAGAAAATGTCAGT AGTGCTAATGCATTTTGCACTAGA 

ACGCTTCGGGAAAATATTCATGCTTGCCATCTGTTCATTTCTAAATTTATAT 

ATACAGGAATTATTAGGAGTAATTCTTTTCTGTTTCTGTTTATAATGAAGAACACTGTAGCTACATTTTC 
AGAAGTTAACATCAAGCCATCAAACCTGGGTATAGTGCAGAAGACGTGGCACACACTGACCACACATTAG 
GCTGTGTGACCATTGTGTGGTGTACCTGCTGGAAGAATTCTAGCATGCTACTTGGGGAGATAATTTCAGT 
GGGAAATATGCCACTGACCGATTTTTTTTTTTTTCCT 

AAGTATTTTTTTCTTTTTTCTC^GTCCTAATTTGGACAGGTCAAAGATGTGTTCAGGCATTCCAGGTAAC 

AGGTGTGTATGTAAAGTTAAAAATAGGCTTTTTAGGAACTCACTCTTTAGATATTTACATCCAGCTTCTC 

ATGTTAAATATTTGTCCTTAAAGGGTTTGAGATGTACATCTTTCATTTCGTATTTCTCATAGGCTATGCC 

ATGTGCGGAATTCAAGTTACCAATGTAACACTGGCCAGCGGGCCO 

AGTCTTATTTAACCAGGGGTCCTAACCACTAACATTGTC 

ACTGAGGTGCTATGAAGCCAACTGAGAAAGATGCATCACGTGTCTTAGGCTGATGCCACTACCCGATTTG 

TTTATTTGCAATTTGAGCCATTTAAAGACCAATAAACTT 

AAAAA 

Human RALA Protein sequence - varl (public gi: 35846) (SEQ ID NO: 269) 
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MAANKPK<&NSLiAIiHKVIW 

GQKD YAAIRDNYFRSGEGFLCVPS I TEMES FAATADFI^QI I^VKEDENVPFIiLiVGNKSDIjEDKRQVSVE 
EAKNRAEQWNVNYVETSAKTRiUtfVDKW 

Human RALA Protein sequence - var2 (public gi: 20147713) (SEQ ID NO: 270) 

MVDYIiANKPKGQNSIiALHKVIMVGSGGTO 

DTAGQEDYAAIRDNYFRSGEGFLCOTSITEMESFAATADFREQILRVKEDENVP 
SVEEAKNRAEQWNVNYyETSAKTRA^ 
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Unigene Name: SIAH1 Unigene ID : Hs. 295923 Clone ID : 3GD_150 
Human SIAH1 rnRNA sequence - varl (public gi: 27503513) (SEQ ID NO: 135) 

CCAGCGCX3TCGCCCCCTGCATCC 

AAAACAAACTTGGGAGACAGACGAAAGCTTAGGGCA<^^ 

TGGAGATGCAGTCGAATTTGAGCTCACAGGGAGGTGTGGTTGCCTCCT 

TCGACCTCTGTAACTGGTGCTTCTGAGAAGTAAATGG^ 

TCTTCTGTGCTTAGGAGCAGAAAGAGCCCAGGAGGGGCCTGTTCCI^ 

CGTGGCCTGACTTCTCATGACGGGAAAGGCTACTCCACCTTCTCTGTACT 

ACATGTTTACCAGCGGCCAGGACAAGGAAGAGAAAAGAAAT^ 

CCGGTACCT03AAGTGTCCAC<^TCCCAGAGGGTG 

CTTGGCGAGTCTTTTTGAGTGTCCAGTCTGCTTTGACTATGTGTTACCGCCC^ 

GGCCATCTTGTTTGTAGCAACTGTCGCCGAAAGCT 

CCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATT 

ATGTGAAATAACTCTGCCACACACAGAAAAAGCAGACCATGAAGAGCTCTGTGAGTTTAGGC 
TGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTCTC 
AGCATAAGTCCATTACAACCCTACAGGGAGAGGATATAO 
TGOTGTTGACTGGGTGATGATGCASTCCTGTTTTGGCTTTC^ 

AAATACGATGGTCACCAGCAGTTCTTCG CAATCGTACAGCTGATAGGAACACGCAAG CAAGCTGAAAATT 

TTGCTTACCGACTTGAGCTAAATGGTC^TAGGCGACGATTGACTTGGGAAGCGACTCCTCGATCTATTCA 

TGAAGGAATTGCAACAGCCATTATGAATAGCGACT^ 

GCAGAAAATGGCAATTTAGGGATCAATGTAACTAT^ 

GCCAGTGTTTAAAACTTCAGTTTGACAGAAAATAAGGGACCCA 

GTAGGTGGAAGCTAGACAGATGAAGGTAAATAAAAAGAAAGGCTC 

AGTAAGACTAATATATTTAAAAATAAGTCAACAGTAA 

ATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATTCTGAGTTTTGTGTT^ 

ATTGTATTGTTGAAAAAGTTTGTTTTTGCGTGGGAGTGTGTGCCTGCGTGGGTGTGTGCGTGTTTGGGTT 

TTTTTCCTTTAACTGACAAGCCATCTTGAGTGGTCATGGGCCACTGCTTTTCC 

TAGTGCTGCTGTGTGCTTTTTTTGTGTGTATTT 

TTTGACTTTTCTGTAATTCAGGTTTTTCCTTTTTTT 

TATTTGTTTATGGTAAAAAATTTATAACGTGTTC^ 

AATATTTTAAAATCy^CTATTTTGTGAAGCCATGAGTTCCAGAi^GTAAAGGTGACATCGGAAAAATAAT 

CAAAAGCTATTTAAAGCATCTATAAGGTGCTCTCTTTCTGTCTTCTACAGATGAGTCACACCTTTGAGCT 

TAATCTTTGAAAGGTTAGAGAATAAATTGATTTTTATAAATACTGCAAATCAGGCTTTTGTTTCCT 

<IAGATATCTTGGAGAAATCACATATTTTAAAATTTGTT CTTGTATT CAGAAGAAGGCAT 

CGTCATGCACAGTATTTGTAATTAAAAGCAAATCATTTGTTTAAAAAGGCA 

GGTCTTTTATAATTCTCATTAAAAGAATATCTGT 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAA 

Human SIAH1 rnRNA sequence - var2 (public gi: 4 506946) (SEQ ID NO: 136) 

GCGGCGGCCAGGGGGGAGCCGGGGCGGCCGTTGCGGGGCGCGCTCTCGAGAGGCGGCGGCGGCCAGGGTG 
TCCCGTCGGTCTCGGCGCCGGGAAGAGGCGGTGGCGCTGCCCGCGGTGGCGGGGGTTGGCGACGGAGCGC 
GTTGGTGCCAGGACCGGGGTCCGAGGCGCGCTCTCCGCCCACAGAAATGAGCCGTCAGACTGCTACAGC^ 
TTACCTACCGGTACCTCGAAGTGTCCACCATCCCAGAGGGTGCCTGCCCTGACTGGCACAACTGCATCCA 
ACAATGACTTGGCGAGTCTTTTTGAGTGTCCAGTCTGCTTTGACTATGTGTTACCGCCCATTCTTCAATG 
TCAGAGTGGCCATCTTGTTTGTAGCAACTGTCGCCCA^ 

TTGGGATCCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATTCAGTACTTTTCCCCTGTAAATATGCGT 

CTTCTGGATGTGAAATAA.CTCTGCGACACACAGAAAAAGCAGACCATGAA^ 

TTATTCCTGTCCGTGCCCTGGTGCTTCCTGT^^ 

ATGCATCAGCATAAGTCCATTACAACCCTA(^GGGAGAGGATATAGTTTTTCTTGCTACA 

TTCCTGGTGCTGTTGACTGGGTGATGATGCAGTCCTGTTTTGGCTTTCACTTCATGTTAGTCriT 

ACAGGAAAAATACGATGGTCACC^GCAGTTCTTCGC^ 

GAAAATTTTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGATTGACTTGGGAA 
CTATTCATGAAGGAATTGCAACAGCCATTATGAATAGCGACTGTCTAGTCTTTGAGACCAGCA 
GCTTTTTGCAGAAAATGGCAATTTAGGCATGkATGT^ 
TTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACAGAAA 

TCTTTCGGTAGGTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTTAAATACAGGAAACAGT 
TGCATGTAGTAACACTAATATATTTAAAAATAAGTCAACAGTAAACCACTGAAAAAATATATGTATATAC 
ACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATTCT 

TAGATTGATTGTATTGTTGAAAAAGTTTGTTTTTGCCGTGGGAGTGTGTGCCTGCGTGGGTGTGTGCGTd 
TTGGTTTTTCCTTAACTGACAAGCCATCTTGAGTGGTC^TGGGCCACTGCTTTTCCCTTTGTGAGTCAAT 
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ACATAGTGCTGCTGTGTGCTTTTTTTGTC 

AAATTTGACTTTTCTGTAATTCAGGTTTTT^ 

GCATATTTGTTTATGGTAAAAAATTTATAACGTGTTCAATAT^^ 

AGAAATATTTTAAAATCAGCTATTTTGTGAAGC 

AATCCCAAGCTATTTAAAGCATCTATAAGGTGCTCT 

GCTTAATCTTGAAGGTTAGAGATAAATTGATTTTTATAAATACrc 

CAGATATCTTGGACAAATCACATATTTTAAAATCT 

CX5TCATGCACAGTATTTGTAATTAAAAGCAAATCATTTGTTTAA 

GGTCTTTTATAATTCTCA 

Human SIAH1 roRNA sequence - var3 (public gi: 16551141) (SEQ ID NO: 137) 

TTTATAATAGCCCTCCAAATGGGTGTGACGTATTTTGATTCTATGTCCCTAATGA 
ATTTTCTAGGATTGATTTTTAAGATGTTACCCi^ 

TGAAAATTATTAACTCTAGAAATTTTAGTTTAAACTAGACTTAGGGATATGTGTATTTTACTGG 

ACGTTTTATGCATGGGTTTTTAAAACTTCTCAAGTATTAAAACTAA 

AGAAATCCTACACTGTCCACTGGAGACATCGATC 

AACCTTACCTCAAACCATGCATCTGGGGCAGAGATCCTTACTTGCTTGGTGGOT 

TGAAGAATGTCATCTTTGTGATTGTTCCTGAAAT^^ 

TAGTGATGTCTACCACTGTGAGCTTCCAGTACTAGGTGATT^ 

TATGTGGCCAGGTAATTCACTGCTGAGGGCTTTGGATTTTCCTTTATGAACT 

TGACTATTACTAAGGGACATTTTGCTACAAAGAATGTTAGTTTTGCCAAT^ 

TTATTTTAACCAGGATTTTAGATGTAAACATCAAGTAGTTTTGGTTGTTTCAATGAAGTAACATGTT 

GCTCACATTATTTGAAGTACTTCAGTTCCTATTGCCATGAAAATTGTATCCAGCAGC 

AAAAAAAAGACTACAGTTAGTCATTATCCAATTTGATGATTTAT 

TGTTTTTTTTACAAACATTTGGTGGATACC^^ 

GGAAGGAAATATCAAAAAGGTCTGAATAGAGAACAGGCAAAT^ 

GTTGGCAGACAGATGGGTGCCTTATTTTCTGTGAAACTGAAGTTTTAAACACTGGCGA.GAAAATG 
TTGCCATTTCAAGACATGGAAATAGTTACATTGATGC^ 

TGCTGGTGTCAAAGACTAGACAGTCGCTATTCATAATGGCTGTTGCAATTCCTTCATGAATAGATCGAGG 
AGTCGCTTCCCAAGTGAATCGTCGCCTATGACCATT^^ 

TTGCGTGTTCCTATCAGCTGTACGATTGCGAAGAACTGCTGGTGACGATCGTATTTTTCCTGTTT 
AGACTAACATGAAGTGAAAGCCAAAACAGGACTGCATCAT 

GTCTGTAGC^GAAAAACTATATCCTCTCCCTGTAGGGTTGTAATGGACTTATGCTGATGCATCAGATGG 

GGCATTACAGCATCCAGAGAG CCTTGC CATTTACAGGAAGCAC CAGGGCACGGACAGGAATAAGGCCTAA 

ACTCACAGAGOTCTTCATGGTCTGCTTTTTCTGTGTGTGGCAGAGTTATTTCACATC^ 

TTTACAGGGGAAAAGTACTGAATTAGC CACTTTCT CCATAGCGAA.GTTGCGAATGGATC CCAAAGGGCC C 

CGGCAAGTTGGACAACATGTGAGCTTTGGGCGACAGTTGCTACAAACAAGATGGCGACT 

GAATGGGCGGTAACAC^TAGTCAAAGCAGACTGGACACTCAAAAAGACTCGCCAAGTCA.TTGTTG^ 

AGTTGTGCCAGTGAGGGCAGGCACCCrCTGGGATGGTGG^ 

GCAGTCTGACGGCTCATTTCTGAAATAAATACATAAGGAGGCAGGAGAAAAATAATTATAACCATGACTT 
ACTTTATAAATAATGTTTACATGCCATAAGTCCTTTTAAAGTTTCATACAAAATTTACTGAGCA/^AAGAG 
GAAGAAAAATAGGATTAAAAAAGATATT 

Human SIAH1 mRNA sequence - var4 (public gi: 21753769) (SEQ ID NO: 138) 

TTTCACCCCCAAGACAAATAGTGGCCTGCCATTTT 

TTTATC^TGACTC^GCCTGGCTAGTTACATTGTCGATTATTTCTTCCAGATGATATTTACCTG 
AATGTTTATTACTCTGCTGATGAATGTTTTCAGCAACGCTGGAGAACCCTAGGCTGCAAGGGGTTCTTCA 
CCTGTTGACTCC^TCCCCCACCCCC^GTATGGCATATATCTCTGCCGTGCTAT(^TCTTTATTCT.TTCCT 
' TTTTCATTGTCTCTTCCTGACTGTCCTC CTTTGTTCATTATGTCTGACACATATTGTGGATTGAAAGTAG 
AACAGAAAGATATACTTTCTCTACC^GACTAAAAAGTTTTGAGATGGCCCrCCATTTCTCCCATGCCTCA 
CTTCACCTTAGTTGTGTTTTTATTTATTTTT^^ 

TCTTGAGGTGGGCACTGATCAGTAGGAAATAAGATTAATACCTGGCTGGTGATAATTTGGGGGGAAGACT 
TAATTAGATAGAGATGGATAATGGGATGGCAGCAGACCTTTC 

AATACACCTCTAGAGTAGATAATTGCTTACCATTAAGAAGAGTTAATGGAAGGTGATACTCTGATTCTTT 
GGCATTGGAACTAC^TTCAATCCGCGGTATCCTC^ 

CTGAGGATGCTCAAGTCCCTGATAGAAAATGGTGTCATATTTGTATGTGCATATTCTCTTGTATAATTTA 
AGTGATCTCTGGATTACTTAATACAATGTAAACAATATGTAAATAGTTGTTATAGACTGTATTTTAAAAA 
TTTTGTATTCTTTATAAATTTTTCTGAATATTTTCAATCC^TGGCTGGTGAAGTCCTCGGATGCAGACCG 
TGTGGATACAGAGTGCCGATTTTATACAGGAGTTCACCTGTAACTCCCTGTACCTATCAAC^GCTGACTC 
CAAATTAGAAAGAAATAGAGTAAGGGAGCCT-CAGG GAGAGTC CTAGCAAAACGGATTCGATTAAACTTCA 
GTTCCTTGTATAGTTTCTTTAGTTGTTTATGGTCCATTTTCTATTTTAGCATTTATTATTCTATGTAGTC 
TATCCAAAGACGATTAAGGGAGTTCCACATGTTTTCCGGAAC7VTTTTGAAAAGAGAGCTTATCCAGTGTA 
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CAGATCCTAATAAAGTGCACATO 

CCTCTTTTGCTCAGTAAATTTTGTATGAAACTCT 

TAAGTCATGGTTATAATTATTTTTCTCCTGCCTC^ 

CTACAGCATTACOTACCGGTACCTCX^GTGTCCACCATCCCAGAGGGTGCCTGCCCT 

TGCATCCAACAATGACTTGGCGAGTCTTTTTGAGTGTCCAGTCT 

CTTCAATGTCAGAGTGGCCATCTTGTTTGT^ 

GGGGCCCTTTGGGATCCATTCGCAACTTGGCTATGGAGA 

ATATGCGTCTTCTGGATGTGAAATAACTCTGCCACACACAG 

TTTAGGCCTTATTCCTGTCCGTGCCCTGGTGCOT 

CCCATCTGATGCATGAGCATAAGTCCAT^ 

CATTAATCTTCCTGGTGCTGTTGACTGGGTGATGATGCAGTCCT 

TTAGAGAAACAGGAAAAATACGATGGTCACCAGCAGTTCTTCGCAATCGTACAGCT 

AG<^U\GCTGAAAATTTTGC^ACCGACTTGAGCTAAATC 

TCCTCGATCTATTCATGAAGGAATTGC^CAGCCATTATGAATAGCGACTGTCTAGTCTTT 

ATTGCACAGCTTTTTGCAGAAAATGGCAATTTAGGCATCAATGTAACTATTTCCATC 

ATCAAACATTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACA 

CCTAAAACTCTTTCGGTAGGTGGAAGCTAGACACATGAAGGTAAATAAAAAGAAAGGCTGTT 

GAAACAGTTGCATGTAGTAACACTAATATATTTAAAAATAAGTCAACAGTAAACCACTGAAAAAATATA^ 

GTATATACACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAATAATT 

GTTTGTTGTAGATTGATTGTATTGTTGAA7VAAGTTTGTTTTTGCGTGGGAGTGTGTGCCT 

GTGCGTGTTTGGGTirrTTTTCCTTTAACTGACAAGCCATCTTGAGTGGTCATGG 

TTTTCATTAAATAAATTTGACTTTTCTGT 

Human SIAH1 mRNA sequence - var5 (pubUcgi: 304i824)(SEQIDNO: 139) 

ATGAGCCGTCAGACTGOTACAGCATTACCTACCGGTACCTCGAAGTGTCCACCATCCCAGAGGGTGCCTG 

CCCTGACTGGCACAACTGCATCCAACAATGACTTGGCGAGTCTTTTTGAGTGTCCAGTCTGCT 

TGTGTTACCGCCCATTCTTCAATGTCAGAGTGGCCATCTTGTOT 

TGTTGTCCAACTTGCCGGGGCCCTTTGGGATCCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATTCAG 
TACTTTTCCCCTGTAAATATGCGTCTTCTGGATGTGAAATAACTCT 

TGAAGAGCTCTGTGAGTTTAGGCCTTATTCCTGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCT 

CTGGATGCTGTAATGCCCCATCTGATGCATCAGCM 

TTTTTCTTGCTACAGAC^TTAATCTTC 

TCACTTCATGTTAGTCTTAGAGAAACAGGAAAAATACGATC^ 

CTGATAGGAACACGCAAGCAAGCTGAAAATTTTGCTTACCGACTTGAGCTAAATGGTCATAGGCGACGAT 
TGACTTGGGAAGCGACTCCTCGATCTATTCATGAAGGAATTGCAACAGCCATTATGAATAGCGAGTGTCT 
AGTCTTTGACACCAGCATTGCACAGCTTTTTGCAGAAAATGGCAATTTAGGCATCAATGTAACTATTTCC 
ATGTGTTGAAATGGCAATCAAACATTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACAGAAAATAAGGCA 
CCCATCTGTCTGCCAACCTAAAACTCTTTCGGTAGGTAGAAGCTAGACACATGAAGGTAAATAAAAAGAA 
AGGCTGTTAAATACAGGAAACAGTTCCATGTAGTAACACTAATATATTTAAAAATAAGTCAACAGTAAAC 
CACTGAAAAAATATATGTATATACACCCAAGATGGGCATCTTTTGTATTAAGAAAGGAAGCATTGTAAAA 
TAATTCTGAGTTTTGTGTTTGTTGTAGATTGATTGTATTGTTGAAAAAGTTTGTTTTTGCGTGGGAGTGT 
GTGCCTGCGTGGGTGTGTGCGTGTTTGGGTTTTTTTCCTTTAACTGACAAGCCATCTTGAGTGGTCATGG 
GCCACTGCTTTTCCCTTTGTGAGTC^TAGATAGTGCTGCTGTAAGCCGTTTTTGTGTGTATTTGCTAAT 
TTTTATTAATTTTAGTTTTTCATTAAATAAATTTGACTTTTCTGTAATTCAGGTTTTTCCTTTTTTTGT^ 
CGATTTTAAAGTTAGTATCTTTTGATATGGC7VTATTTGTTTATGGTAAAAAATTTATAACGGGTTCAATA 
TTTTCTTTTCCCCCATTAATCAAGTCCATTGGAAATATTTTAAAACCAGCCTATTTTGGTGAACCCATGA 
GTTCCCAGAAAGTAAAGGTGACACCCGGAAAAATAATCCAAAAGCCTATTTAAAGCCACCTATAAGGTGC 
CCCCCTTTCCTGTCTTCCTACAGATGAGTCACACCTTTGAGCCTTAACCTTTGAAAGGTTAGAGAATAAA 
TTGATTTTTATAAATACTGCAAATCCAGGCTTTTGTTTCCTTTTTCCAGATATCCTTGGACAAATCACAT 
ATTTTAAAATTTGTTCTTGTATTTATTGGTTTTGCAGAAGAAGGCATCGTCATGCACAGTATTTGTAATT 
AAAAGCAAATTCATTTGTTTAAAAAGGCAGTTTGCAAA/y^TGTTTTTGGTCTTTTATAATTCTCA 

Human S1AH1 mRNA sequence - var6 (public gi: 17390431) (SEQ ID NO: 140) 

CGGCGCCGGGAAGAGGCGGTGGCGCTGCCCGCGGTGGCGGGGGTTGGCGACGGAGCGCGTTGGTGCCAGG 
ACCGGGGTCCGAGGCGCGCTCTCCGCCCACAGAAATGAGCCGTCAGACTGCTACAGCATTACC!TACCGGT 
ACCTCGAAGTGTCCACCATCCCAGAGGGTGCCTGCCCTGACTGGCAGAACTGCATCGAACAATGACTTGG 
CGAGTCTTTTTGAGTGTCCAGTCTGCTTTGACTATGTGTTACCGCCCATTCTTCAATGTCAGAGTGGCCA 

CGCAACTTGGCTATGGAGAAAGTGGCTAATTCAGTACTTTTCCCCTGTAAATATGCGTCTTCTGGATGTG 
AAATAACTCTGCCACACACAGAAAAAGCAGACCATGAAGAGCTCTC 

GTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTCTGGATGCTGTAATGCCCCATCTGATGCATCAGCAT 
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iUVGTCCATTACAACCCTACAGGGAGAGGATATAGTTTTTCTTGCT 

TTGACTGGGTGATGATGCAGTCCTGTTTTGGCTTT 

CGATGGTCACCAGCAGTTCTTCGCAAT(XjTACAGCTC 

TACCGACTTGAGCTAAATGGTCATAGGCGACGATTGACTTGGGAA 

GAATTGCAACAGCCATTATGLAATAGCGACTGTCTAGTCTTTC 

AAAATGGCAATTTAGGC^TCAATGTAACTATTTCCATGTGTTGAAAT 

GTGTTTAAAACTTGAGTTTCACAGAAAATAAGGCACCC^TCTGT 

GTGGAAGCTAGACAGATGAAGGTAAATAAAAAGAAAGGCTGTTAAATACAG 

ACACTAATATATTTAAAAATAAGTCAACAGTAAACCACT^ 

GCATCTTTTGTATTAAGAAAGGAAGCATTGT^ 

TATTGTTGAAAAAGTTTGTTTTTGCGTGGGAGTGTC^ 

TCCTTTAACTGACAAGCCATCTTGAGTGG 

GCTGCTOTGTGCTTTTTTTGTGTGTATTTGCTAATTTTr 

ACTTTTCTGTAATTCAGGTTTTTCCTTTT^ 

TGTTTATGGTA7VAAAATTTATAACGTGTTCAATATTTTCTTTTCCCCCA 

TTTTAAAATCAGCTATTTTGTGAAGCCATGAGTTC^ 

AGCTATTTAAAGCATCTATAAGGTGCTCTCTTTCT 

CTTTGAAAGGTTAGAGAATAAATTGATTTTTATAAATACTGCAAATCAGGCTTTTGTTTC 
TATCTTGGACAAATCACATATTTTAAAATTTC 

ATGCAGAGTATTTGTAATTAAAAGCAAATCATTTGTTTAAAAAGGCAGCT 
TTTTATAATTCTCATTAAAAGAATATCTGG CCATTTTTAAAAAAAAAAAAA^ 
AAAA 

Human SIAH1 toRNA sequence - var7 (public gi: 23274141) (SEQ ID NO: 141) 

GTCCCGTCGGTCTCGGCGCCGGGAAGAGGCGGTGGCGCTGCCCGCGGTGGCGGGGGTTGGCGACGGAGCG 

CGTTGGTGCCAGGACCGGGGTCCGAGGCGCGCTCTCCGCCCACAGAAATGAGCCGTCAGACTGCTACAGC 

ATTACCTACCGGTACCTCK3AAGTGTCCACCATCCCAGAGGGTGCCTGCCCTGACTGGCACAACTGCATCC 

AACAATGACTTGGCGAGTCTTTTTGAGTGTCCA 

GTCAGAGTGGCCATCTTGTTTGTAGCAACTGTCGCCCA^ 

TTTGGGATCCATTCGCAACTTGGCTATGGAGAAAGTGGCTAATTCAGTACTTTTCCCOT 
TCTTCTGGATGTGAAATAACTCTGCCACACACAGAAAAAGC^ 

CTTATTCCTGTCCGTGCCCTGGTGCTTCCTGTAAATGGCAAGGCTCTGTGGATGCTGTAATGCCCCATCT 
GATGCATCAGCATAAGTCCATTACAACCCTACAGGGAGA 

CTTCOTGGTGCTGTTGACTGGGTGATGATGCAGTCCTGTTTTGGCTTTCACTTCATGTTAGTCTTAGA 

AACAGGAAAAATACGATGGTCACCAGCAGTTCTTCGCAATCGTACAGC^GAT 

TGAAAATTTTGCTTACCGACTTGAGCTAAA^ 

TCTATT(^TGAAGGAATTGCAAGAGCCATTATC 

AGCTTTTTGCAGAAAATGGCAATTTAGGCATGAATGTAACT^ 

ATTTTCTGGCCAGTGTTTAAAACTTCAGTTTCACAGAAAATAAGGCACCCATCTGTCTGCCAA 
CTCTTTCGGTAGGTGGAAGCTAGACAGATGAAGGTAAATA 

TTGCATGTAGTAACACTAATATATTTAAAAATAAGTCAACAGTAAACCACTGAAAAAATATATGTATATA 
CACCCAAGATGGGGATCTTTTGTATTAAGAAAGGAAGGAT^ 

GTAGATTGATTGTATTGTTGAAAAAGTTTGTTTTTGCGTGGGAGTGTGTGCCTGCGTGGGTGTGTGCGTG 
TTTGGGTTTTTTTCCTTTAACTGACAAGCCATCT^ 

TCAATACATAGTjaCTGCTGTGTGCTTTTTTTGTGTGTATTTGCTAATTTTTATTAATTTTAGTTTTTCAT 
TAAATAAATTTGACTTTTCTGTAAAAA7VAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SI AH 1 Protein sequence - varl (public gi: 27503514) (SEQ ID NO: 271) 

OTGKATPPSLYSWRGVLFTCLPAARTRKRKEMSRQTATALPTGT^ 

ECPVCFDYVLPPIIiQCQSGHLVCSNCRPKIiTCCPTaiGPLGSIRNLAMEKVANSVIjF 

PHTEKADHEELCEFRPYSCPCPGASCKWQGSLDAVMPm 

MMQS CFGFHFI^VLEKQEKYDGHQQFFAI VQLI GTRKQAEMFAYRLEIiNGHRRRLTWEATPRS IHEGI AT 
AIMNSDCIiVFDTSIAQLFAENGNLGINVTISMC 

Human SIAH1 Protein sequence - var2 (public gi: 4506947) (SEQ ID NO: 272) 

MSRQTATALPTGTSKCPPSQRVPALTGTT7\SNNDIxASLFECPVCFDYVIiPPIIjQCQSGHLVCSNCRPKLT 

CCPTCRGPIXSSIRNLAMEKVANSVLFPC^ 

LDAVMPHLNMQHKSITTLQGEDIVFLA^^ 

LI GTRKQAENFAYRIiELWGHRRRIjTWEATPRS I HEGI ATAI MNSDCLrVFDTS I AQLFAENGNLGINVTI S 
MC 
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Unigene N^^ > SMN1 Unigene ID : Hs. 288986 Clone ID : GD_1114 
Human SMN1 mRNA sequence - varl (public gi: 624185) (SEQ ID NO: 142) 

CGGGGCCCCACGCTGCGCATCCGCGGGTTTGCT^ 

CGGAGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTG^ 

TACAGCACTGATAAAAGCATATGATAAAGCTGTGGCTTC^^ 

TGTGAAACTTCGGGTAAACCAAAAACCAGACCT^ 

AGAATACTGCAGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTOT 

TTGGATTTACCCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTG 

TATGGAAATAGAGAGGAG(^AAATCTGTCCGATOTACTTTCCC(^TCTC 

AACAGAATGCTCAAGAGAATGAAAATGAAAGCCAAGTTTCA^ 

TGGAAATAAATCAGATAACATCAAGCCCAAATCTGCTCCATGG 

ATGCCAGGGCCAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAAT 

CACCACCACCCCACTTACTATCATGCTGGCTGCCTCC^^ 

ACCTC CCATATGTC CAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACATG 
AGTGGCTATCATACTGGCTATTATATGGGTTTCAGACAAAATC^W^GAAGGAAGGTGCTCACATTCCT 
TAAATTAAGGAGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGACAGATCT 
GGAATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCi^ 

GGAAGTGGAATGGGTAACTCTTCI^GATTAAAAGTTATGTAATAACCAAATGCAATGTC 

TGGACTCTTTTGAAAAACCATCTGTAAAAGACTGGGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAG 

TTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCTGTG 

AGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAG 

AGTGTCTTAAAATGTTTGAAATGGTTTAAGAAAATGTATGTGAGGCGTATGTGGCAAAATGTTACAGAAT 

CTAACTGGTGGACATGGCTGTTCATTGTACTGTTTT 

AAAATATTTAATTTTTTTTTA 

Human SMN1 mRNA sequence - var2 (public gi: 15929773) (seq id no: 143) 

ggccccacgctgcgcacccgcgggtttgctatggcgatgagcagcggcggcagtggtggcggcgtcccgg 

agcaggaggattccgtgctgttccggcgcggcacaggccagagcgatga^^ 

agcactgataaaagcatatgataaagctgtggcttcatttaagcatgctctaaagaatggtgacatttgt 

gaaacttcgggtaaaccaaaaacgagacctaaaagaaaaccto 

atactgcagcttccttacaacagtggaaagttggggacaaatgttctgccatttggtcagaagacggttg 
catttacccagctaccattgcttcaattgat^ 

ggaaatagagaggagcaaaatctgtccgatctactttccccaatctgtgaagtagctaataatatagaac 
agaatgctcaagagaatgaaaatgaaagccaagtttcaacagatgaaagtgagaactccaggtctcctgg 
aaataaatcagataacatcaagccgaaatotgctccatggaactcttttctccctccaccaccccccatg 
cgagggccaagactgggaccaggaaagccaggtctaaaattcaatggcccaccaccgccaccgccaccac 
(^ccaccccacttactat^tgctggctgcctccatttccttctggacc^ccaataattcccccaccacc 
tcccatatgtccagattctcttgatgatgctgatgctttgggaagtatgttaatttcatggtacatgagt 
ggctatcatactggctattatatgggttttagacaaaatcaaaaagaaggaaggtgctcacattccttaa 
attaaggagaaatgctggcatagagcagcactaaatgacaccactaaagaaacgato^gacagatctgga 

ATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGA 
AGTGGAATGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACTGG 
ACTCTATTTTGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGT 
TGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGAGCTGTGA 
GAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAGA 
GTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGTATG 

TAACTGGTGGACATGGCTGTTCATTGTACTGTTTTTTTCTATCTTCTATATGTTTT^AAAGTATATAATAA 
AAATATTTAATTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAATUU^kAAAA 

Human SMN1 mRNA sequence - var3 (public gi: 13259511) (SEQ ID NO: 144) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCA 
CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 
XSGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATT^ 

ATAAAAGCATATGATAAAGCTGTGGCTTC^TTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAAOT 

CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAG 

AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTC^GAAGACGGTTGCATTTAC 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACACTGGATATGGAAATA 

GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGC 
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TCAAGAGAATGAAAATGAAAGCC^ 

TCAGATAACATCAAGCCGAAATCTGCTCCATGGAACT^ 

GAAGACTGGGACCAGGAAAGATAATTCCCCCACC^ 

TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATC^ 

CAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAA^ 

AATGACACGACTAAAGAAACGATCAGACAGATCTGGAATGTGA 

TTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAA 

ATGTAATAACCAAATGCAATGTGAAATATTTTACTGGA 

GTGGGGGTGGGAGGCCAGCACGGTGGTGAGG<^GTTGAGAAAATTTGAATGT 
TATTGGATAATTATTGGTAATTTTATGGCCTGTGA 

TTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGG 

TATGTGAGGCGTATGTGGCAAAATGTTACAGAATCTAACTGGTGGAGATGGCTGTTCATTGTACTGTTCT 

TTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTAATTTTTTTTTA 

Human SMN1 mRNA sequence - var4 (public gi: 13111817) (SEQ ED NO: 145) 

GGGGCCCCACGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTG 
GGAGCAGGAGGATTCCXSTGCTOTTCCGGCGC 

ACAGCACTGATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACA 
GTGAAACTTCGGGTAAACCAAAAACCACACCTAAAAGAAAAC 

GAATACTGCAGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCATTT 
TGCATTTACCCAGCTACCATTGCTTCAATT 

ATGGAAATAGAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGA 

ACAGAATGCTCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGG 

GGAAATAAATC^GATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTT 

TGCCAGGGCCAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCCACCACCGCCAC^ 
ACCACCACCCCACrTACTATCATGCTGGCTGCCTCCATTTCCTTCTGGACCACCAATAATTCCCCCACCA 
CCTCCC^TATGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACATGA 
GTGGCTATCATACTGGCTATTATATGGAAATGCTG 

CGAT CAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCT CATTTCTTCAAAATATCAAGTG 
TTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAA 
TGTGAAATATTTTACTGGACTCTATTTTGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCA 
GCACGGTGGTGAGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGG 
TAATTTTATGAGCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATT 
TAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGTATGTGAGGCGTATGTG 
GCAAAATGTTAGAGAATCTAACTGGTGGACATGGCT 



Human SMN1 mRNA sequence - var5 (public gi: 13259515) (SEQ ID NO: 146) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCA 
CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 
GGATTCCGTGCTGTTCCGGCGCGGC^GAGGCCAGAGCGATGAT^ 

ATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 

CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGCCA 

AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGC 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGGTTTACACTGGATATG^ 
GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGC 
TCAAGAGAATGAAAATGAAAGCC^GTTTCAACAGAT^^ 
TCAGATAACATCAAGCCCAAATCTGCTCGATGGAACTCTTO 

CAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCC^CC^CCGCCACCGCCACCACaiCCACC 

CCACTTACTATCATGCTGGCTGCCTCC^ 

TGTCGAGATTCTCTTGATGATGCTGATGCTTTGGGAAGT^ 

ATACTGGCTATTATATGGGTTTCAGACAAAATCAAAAAGAAGGAAGGTGCTCAGATTCCT 

AGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGACAGATOTG 

GCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAA 

TGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACT 

TGAAAAACCATCTGTAAAAGACTGGGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAAT 

TTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCTGTGAGAAGGGTGT 

TGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTTAA 

AATGTTTCAAATGGTTTAAGAAAATGTATGTGAGGCGTATGTGG 

GAC^TGGCTGTTCATTGTACTGTTTTTTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTA 
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Human SMN1 Protein sequence - varl (public gi: 13259512) (seq id no: 273) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKA^ 
KRK3PAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASI 

LliSPICEVANKTI EQNAQENENESQVSTDESENSRSPGNKSDNIKPKBAPWNSFLPPPPPMPGPRLGPGKI 
IPPPPPICPDSIiDDiU^AIiGSMLISWYMSGYHTGYyMGFRQNQKEGRCSHSIiN 

Human SMN1 Protein sequence - var2 (public gi: 12654181) (seq id no: 274) 

MAMSSGGSGGGVPEQEDSVIiFRRGTGQSDDSDIWDDTALI 
KRKPAKKNKSQKKOTAASLQQWKVGDKCSAIWSETC^ 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRLG 

GLKFNGPPPPPPPPPPHLLSC^LPPFPSGPPIIPPPPPICPDSI^DADALGSMLISWYMSGYHTGyyMEM 

LA 

Human SMN1 Protein sequence - var3 (public gi: 4507091) (seq id no: 275) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVAS 

KRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKRETCW 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPK^^ 

GLKFNGPPPPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADAL^ 

RQNQKEGRCSHSLN 

Human SMN2 mRNA sequence - varl {public gi: 735410) (SEQ ID NO: 147) 

GCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCX3GAGCAGGAGGATTCCGTGCTGTTCCGGCGCGGCA 

CAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTGATAAAAGCATATGATAAAGCTGTGGC 

TTC^TTTAAGCATGCTCTAAAGAATGGTGACATTTC 

AGAAAACCTGCTAAGAAGAATAAAAGCCAAAAGAAGAATACTG^^ 

GGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATTTACCCAGCTACC^ 

tAAGAGAGAAACCTGTGTTGTGGTTTACACTGGATATGGAAATAGAGAGGAGCAAAATCTGTCCGATCTA 

CTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGCTCAAGAGAATGAAAATGA^ 

TTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAA 

TCCATGGAACTCTTTTCTCCCTCCACCACCCC^ 

CTAAAATTCAATCGCCCACGATCGCCACCGCCACCAC^^ 

CATTTCCTTCTGGACCACCAATAATTCCCCCACCACCTCCCATATGTCCAGATTCTCT 
TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGG 

GAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAAATTAAGGAGAAATGCTGGCATAGAGCAGC^ 
AATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCAT 
TTCTTGAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTT 
ATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCT 

AGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGGAGTTGAGAAAATTTGAATGTGGATTAGACTTTGAAT 
GATATTGGATAATTATTGGTAATTTTATGAGCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTA 
ATTTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAA 
TGTATGTGAGGCGTATGTGGCAAAATGTTACAGAATCTAACTGGTGGACATGGCTGTTCATTGTACTGTT 



Human SMN2 mRNA sequence - var2 (public gi: 13259530) (seq id no: 148) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 
ACCGTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCA 
CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 
GGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTG 
ATAAAAGC^TATGATAAAGCTGTGGCTTCATT^ 

CGGGTAAACCAAAAACCAGACCTAAAAGAAAACCTGCTAAGAAGAATAAA 

AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATTTAC 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCT . 

GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGC 

TC^AGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAA 

TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCT^ 

CAAGACTGGGACCAGGAAAGATAATTCCCCGACCACCTCCCATATC 

TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATACTGGCTATTATATGGAAATGCTG 
GCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGA 
AGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACTC 
TTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATATTTTACTGGACTCTTTTGAAAAACCA 
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TCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCAGCACXSGTG^ 
ATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCT 
AAAAGACTGTCTTAATTTGCATACTTAAGCATTT^ 
ATGGTTTAACAAAATGTATGTGAGGCGTATGTGGCAAAATGTTACAGAA 

AAA 

Human SMN2 mRNA sequence - var3 (public gi: 13259528) (seq id no: 149) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGT 

ACCXSTACTGTTCCGCTCCCAGAAGCCCCGGGCGGCGGAAGTC^ 

CGCTGCGCACCCGCGGGTTTGCTATGGCGAT^ 

GGATTCCGTGCTGTTCCGGCGCGGCAGAGGCCAGAGCGATGAT^ 

ATAAAAGCATATGATAAAGCTGTGGCTTCATTTAAGCATGCTC 

CGGGTAAACCAAAAAC(^CACCTAAAAGAAAACCTGCT^ 

AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCAOT 

CCAGCTACCATTGCTTCAATTGATTTTAAGAGAGAAACCTGTGTTGTGG 

GAGAGGAGCAAAATCTGTCOTATCTACTTTCCCCAATCTGTGAAGTAGCT 

TCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCT 

TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTOT 

CAAGACTGGGACCAGGAAAGATAATTCCCCCACCACCT^ 

TGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATAerGGCTA 
CAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAAATT^ 

AATGACAGCACTAAAGAAACGATCAGACAGATCTGGAATGTGAAGCGTTATAGAAG 

TT CTTCAAAATAT CAA.GTGTTGGGAAAGAAAAAAGGAAGTGGAATGGGTAACT CTTCTTGATTAAAAGTT 
ATGTAATAACGAAATGCAATGTGAAATATTTTACTGGACTCTTTTGAAAAACCA 

GTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGA 
TATTGGATAATTATTGGTAATTTTATGGCCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAAT 
TTGCATACTTAAGCATTTAGGAATGAAGTGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATG 
TATGTGAGGCGTATGTGGCAAAATGTTACT^GAATCT^ 

TTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTAATTTTTTTTTAAA 

Human SMN2 mRNA sequence - var4 (public gi: 13259526) (seq id no: 150) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCT 

ACCGTACTGTTCCGCrrCCCAGAAGCCCCGGGCGGCGGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCA 

CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 

GGATTCCGTGCTGTTCCGGCGCGGCACAGGCCAGAGCGATGATTCTGACATTTGGGATGATACAGCACTG 

ATAAAAGCATATGATAAAGOTGTGGCTTCATTTAAGCATGCTCTAAAGAATG^ 

CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAA 

AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTTCTGCCATTTGGTCAGAAGACGGTTGCATT'l'AC 
CCAGCTACCATTGCTTCAATTGATTTT7UVGAGAGAAACCTGTGTTGTGGTTTACACTGGATATGGAAATA 
GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAGAATGC 
TCAAGAGAATGAAAATGAAAGCCAAGTTTCAAGA.GATGAAAGTGAGAACTCCAGGTCTCCTGGAAAT7UVA 
TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCT 

CAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTCAATGGCCCACCACCGCCACCGCCACCACCACCACC 
CCACTTACTATCATGCTGGCTGCCTCCATTTCCTTCTGGACCACCAATAATTCCCCCACCACCTCCCATA 
TGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATGTTAATTTCATGGTACATGAGTGGCTATC 
ATACTGGCTATTATATGGAAATGCTGGCATAGAGCAGCACTAAATGACACCACTAAAGAAACGATCAGAC 
AGATCTGGAATGTGAAGCGTTATAGAAGATAACTGGCCTCATTTCTTCAAAATATCAAGTGTTGGGAAAG 
AAAAAAGGAAGTGGAATGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATA 
TTTTACTGGACTCTTTTGAAAAACCATCTGTAAA^ 

AGGCAGTTGAGAAAATTTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGG 
CCTGTGAGAAGGGTGTTGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTTAGGAATGAAG 
TGTTAGAGTGTCTTAAAATGTTTCAAATGGTTTAACAAAATGT^ 

CAGAATCTAACTGGTGGAGATGGCTGTTCATTGTACTGTTTTTTTCTATCTTCTATATGTTTAAAAGTAT 
ATAATAAAAATATTTAATTTTTTTTTAAA 

Human SMN2 mRNA sequence - var5 (public gi: 13259525) (seq id no: 151) 

CCACAAATGTGGGAGGGCGATAACCACTCGTAGAAAGCGTGAGAAGTTACTACAAGCGGTCCTCCCGGCC 

ACCGTACTGTTCCGCTCCC^GAAGCCCCGGGCGGaSGAAGTCGTCACTCTTAAGAAGGGACGGGGCCCCA 

CGCTGCGCACCCGCGGGTTTGCTATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGA 

GGATTCCGTGCTGTTCCGGCGCGGCACAGGCCT^GAGCGATGATTCTGACATTTGGGATGATACAGCACT 

ATAAAAGCATATGATAAAGCTGTGGOTTCATTTAAGCATGCTCTAAAGAATGGTGACATTTGTGAAACTT 
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CGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAAT^ 
AGCTTCCTTACAACAGTGGAAAGTTGGGGACAAATGTT 
CCAGCTACGATTGCTTGAATTGATTTT^ 
GAGAGGAGCAAAATCTGTCCGATCTACTTTCCCCAATCTC 
TCAAGAGAATGAAAATGAAAGCCAAGTTTCAACAGATGAAAGTC^ 
TCAGATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTCCACC^ 
CAAGACTGGGACCAGGAAAGCCAGGTCTAAAATTGAATGGCCCAC 
CGACTTACTATCATGCTGGCTGCCTCCATC^ 
TGTCCAGATTCTCTTGATGATGCTGATGCTTTGGGAAGTATC 
ATACTGGCTATTATATGGGTTTTAGACAAAATCAAAAAGAAGGAAGGTGCTC^ 
AGAAATGCTGGCATAGAGCAGCACTAAATGAGACC^ 
' GCGTTATAGAAGATAACTGGCCTC7VTTTCT 

TGGGTAACTCTTCTTGATTAAAAGTTATGTAATAACCAAATGCAATGTGAAATAT 
TGAAAAACCATCTGTAAAAGACTGAGGTGGGGGTGGGAGGCCAGCACGGTGGTGAGGCAG 
TTGAATGTGGATTAGATTTTGAATGATATTGGATAATTATTGGTAATTTTATGGCCTGTGAGAAGGGTGT 
TGTAGTTTATAAAAGACTGTCTTAATTTGCATACTTAAGCATTT 

AATGTTTCAAATGGTTT AACAAAATGTATGTGAGG CGTATGTGG CAAAATGTTACAGAATCTAACTGGTG 
GACATGGCTGTTCATTGTACTGTTTTTTTCTATCTTCTATATGTTTAAAAGTATATAATAAAAATATTTA 



Human SMN2 Protein sequence - varl (public gi: 736411) (seq id no: 276) 

AMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDICETSGKPICrTPK 
RKPAKKNKSQKKOTAASLQQWKVGDKCSAIWSEDGCIYPATIASIDFKRETCW 

LSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRIjGPGKPG 

LKFNGPPPPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADALGS^ 

QNQKEGRCSHSLN 

Human SMN2 Protein sequence - var2 (public gi: 13259531) (seq id no: 277) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDI WDDTALI KAYDKAVASFKHALKNGDI CETSGKPKTTP 
KRKPAKKNKSQKKNTAASI^QWKVGDKCSAIWSEDGCIYPATIAS 

LLSPICEVANNIEQNAQENENESQVSTDESENSI^PGNKSDNIKPKSAPWNSFLPPPPPMPGPRLGPGKI 
IPPPPPICPDSLDDADALGSMLISVryMSGYHTGYYMEMIiA 

Human SMN2 Protein sequence - var3 (public gi: 13259529) (seq id no: 278) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDICETSGKPKTTP 
KRKPAKKNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATIASID^ 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRLGPGKI 
IPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGFRQNQKEGRCSHSLN 

Human SMN2 Protein. sequence - var4 (public gi: 13259527) (seq id NO: 279) 

MAMS SGGSdGGVPEQEDS VLFRRGTGQSDDSDI WDDTALI KAYDKAVAS FKHALKNGDI CETSGKPKTTP 
KRKPAKXNKSQKKNTAASLQQWKVGDKCSAIWSEDGCIYPATI 

LLS P I CEVANNI EQNAQENENESQVSTDES ENSRS PGNKSDNI KPKS APWNS FLPPPPPMPGPRLGPGKP 
GLKFNGPPPPPPPPPPHLLS CWLPPFPSGPP 1 1 PPPPPI CPDSLDDADALGSMLISWYMSGYHTGYYMEM 
LA 

Human SMN2 Protein sequence - var5 (public gi: 10937869) (seq id no: 280) 

MAMSSGGSGGGVPEQEDSVLFRRGTGQSDDSDIWDDTALIKAYDKAVASFKHALKNGDICETSGKPKTTP 
KRKPAICKNKSQIOCNTAASLQQWKVGDKCSAIWSEDGCIYP^^ 

LLSPICEVANNIEQNAQENENESQVSTDESENSRSPGNKSDNIKPKSAPWNSFLPPPPPMPGPRLGPGKP 
GLKFNGPPPPPPPPPPHLLSCWLPPFPSGPPIIPPPPPICPDSLDDADALGSMLISWYMSGYHTGYYMGF 
RQNQKEGRCSHSLN 



Unigene Name; SNX1 Unigene ID: Hs. 498154 

Human SNX1 mRNA sequence - varl (public gi: 3152939) (SEQ ID NO: 152) 

ATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCGGCCTGGAGCCGG 
AGTCCGAGGGGGCX3GCCGGGGGATCZAGAACCCGAGGCTGGGGACAGCGACACCGAGGGGGAGGACATTTT 
CACCGGCGCCGCGGTGGTC^GTAAACATCAGTCTCCAAAGATAACT^ 
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GGCTCCAAAGAAAATGGGATCCA^ 
TGGAGCTATCCTTGGACAGCACACAAAATA 

TCCTCAGGAAGCX!ACAAATTCTTCGAAGCCCCAGCCAACCTATG^ 
GATCAATTTGATTTGACAGTCGGTATAACTGATCCTG^ 
CCTACAAAGTTACAACACAGACAAGCTTACCATT^^ 
TAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCX^ 

TCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTTCT^ 

TTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTT 

GGACCCI^CGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGC^ 

GGTGCTGGTCTCCTCAAGATGTTCAACA^GCCACAGATGCCGT 

AATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGOT 

TGCTGTTGTAGAAACTCTAGTCAACC^TAGGAAAGAGCTAGCGCTGAACACAGCCCAGTTTGCAAAGAGT 

CTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTG 

AAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGACT 

TGACTACA.TTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAGACATGGCA 
CAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCT 

ATAAGCTGCA.GCAGGCCAAGGACGAGAT CCTCGAGTGGGAGT CTCGGGTGACTCAATATGAAAGGGACTT 

CGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATACGGTT 

AACCACGTGATCAAGTACCTTGAGACACTCCTTTA 

TCCTTCCTGAGGCAAAGGCCATCTCCTAA 

Human SNX1 mRNA sequence - var2 (public gi: 3152941) (SEQ ID NO: 153) 

ATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCGGCCTGGAGCCGG 

AGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGA.GGCTGGGGACAGCGACACCGAGGGGGAGGACATTTT 

CACCGGCGCCGCGGTGGTCAGTAAACATCAGTCTCCAAAGATAACTAC^ 

GGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCAC^^ 

TGAATGCATATGTAGCCTACAAAGTTACAACAGA.GACAAGCTTACCATTG 

AGTAAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAG CTTTCCGAGAAG CACTCTCAGAATGGC 

TTCATTGTCCCTCCATCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATT 

CTTCTTCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTGTAAATGA 

TCCTACCATGTTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGCCACGTC 

ACCGAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTT 

CCATCAAGATGAATGAAT CAGACATTTGGTTTGAGGAGAAGCT CCAGGAGGTAGAGTGTGAGGAG CAGCG 
CTTACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAA.GAGCTAGCGCTGAAGA 
CAGTTTGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTGTC^CGGG 
AGCTGGCTGAGGTGGAAGAAAAAATTGAGC^GOTCCACCAGGAACAGGCCAA 

TGCTGAGCTCCTGAGTGACTACATT03CCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAG 
ACATGGCAGCGCTGGCAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGGCTGCTGT 
GGGCCAACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCT^ 

ATATGAAAGGGACTTCGAGAGGATTTCAAC^GTGGTCCGAAAAGAAGTGATACGGTTTGAGAAAGAGAAA 

TCC^GGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCT 

AGTACTGGGAAGCCTTCCTTCCTGAGGCAAAGGCCATCTCCTAA 

Human SNX1 nJRNA sequence - var3 (public gi: 30582804) (SEQroNO: 154) 

ATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCGGCCTGGAGCCGG 
AGTCCGAGGGGGCGGCCGGGGGATGA.GAACCCGAGGCTGGGGACAGCGACA.CCGAGGGGGAGGACATTTT 
C^CCGGCGCCGCGGTGGTCAGTAAAC^TCAGTCTCC^U^ 

GGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCT^ 

TGGAGCTATCCTTGGACAGCACACAAAATAATCAGAAGAAGGTGCTAGCCAAAACACTCATTTCTCTTCC 
TCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCCAACCTATGAGGAGCTAGAGGAAGAAGAAC^VGGAG 
GATC^^TTTGATTTGACTVGTCGGTATAACTGATCCTGAGAAGATAGGGGATGGTATGAATGCATATGTAG 
CCTACAAAGTTACAACACAGAC^GCTTACCATTGTTCAG^^ 

TAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCTCAGAATGGCTTCATTGTCCCTCCA 

CCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTTCTTCTGCAGAAT 

TTCTTGAAAA&CGGAGGGCCGCTTTAGAAAGGTACC 

GGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGCCACG 

GGTGCTGGTCTCCTCAAGATGTTCAAC^AAGCCACAGATC 

AATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGCGCTTACGGAAACTGCA 
TGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAACACAGCCCAGTTTGCAAAGAGT 
CTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTGTCACGGGCACTCTCCCAGCTGGCTGAGGTGG 
AAGAAAAAAT TGAG CAG C T C CAC C AGG AACAGG C CAACAATG ACTT CTT C CT C CTTG C TGAGCT C CTG AG 
TGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAGACATGGCAGCGCTGG 
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CAGGATGCCC7UIGCCACACT 

ATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAG 
CGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATAC^^ 
AACCACGTGATGAAGTACCI^GAGACACTCCTTT^ 
TCCTTCCTGAGGCAAAGGCCATCTCCTAG 

Human SNX1 rnKNA sequence - var4 (public gi: 4884359) (SEQ ID NO: 155) 

GGTTGCTTTGTTAAGTTCCATCTAATGATCATTCT 

TGTCTCCTCTGGTTTGAGAACTTCCTCCT 

AGGGCTTGTGCATTTTTGTAAAGAGCTTGCAOT 

CTGAGCCAGAAGCCTCTACAAGGAATAACAGGAGCACAAAGGAAGAAGGTO 

AGGAGGGAGGACTTTGTGGAGAACCTGATGCTTGAACTGAGTCTAAAAGGTGTAAAAGT^ 

CTGCCTCCCTGTCTGTCTGGCAGGGTGAGGTAGGCGCATCTAGGGAAATGTCAAGTG^ 

TAAAGTCAGTGAGGCCCATGGAGAAAAACGAGCAGGAGCCAC^^ 

AGGCGCTTTCCACATTACCATTGTCGCTTCGTGATCTGGACACACCAGAAGGCGT^ 

AAGAGCAGCCAGGCTTATCCCTACCCTCAGGAGAGCTGAAAAGGGCAGGTATGGTGGGGCCAGAGCT 

GAGAGTTTCGGAACCACTGAGATCGGTCCTTGATTTGATGAGAGGCTTGAGGGGAGAGGGAGGTAGCTAG 

GATGCCCCGGAAGCTTCTGGCCCAGACACTG 

GTTTGGGGCAGATGGGAGGGGAAGCAGTGGTGGGGGCAGTGAGTGCTGAAGGTGTTTTAAGAAGCGGCTC 
TGGGCCAGGCACAGTGGCHTATGCCTGTATTCCTAGC^ 

GCCCAAGAATTTGAGACCAGCCTGGGGAATATAGTGAGACCCTGTCCCTACAAAAATAAAAACAACTAG 

TGGGTGTGTGGTGGTGCATGCCTGTAGGCCCAGCTACGCGGGAACATCACOTGAGCCCAGGAGGTTGAGG 

TTGCAGTGAGCTAGAGTTGCGCCACTGCACTCC^ 

AAAACAGCTCTGGATGGGAAGGGAGGCCAGTTGCTTTAAGTAGGGGAGATAGAGTTAAAGGAGGCTTTGT 
TTTATTTAAAGGTGGGACAAACTTAAGC^TGTTAATAAAATTCAGAGAAGAGAAAGAGAATGACTATCAG 
AGCC^TGTTTGGAAGAAAATGGGGTCCAGAGCACAGGAAGGGGACCTGTGTTCAGAGGGTGCCTCACTGC 
TGAGGCCACAGGAAAGAATCTGTAGGTGGAGGGGAGGCCGAAGAGGGGAAGTTTCATGCTTGATAATTAA 

•AATTTTCTGAGATAGGAATGTCATATTTACCTATTTAAGCCAAGTTTTTTTAGATAAAAGGTATGGAACC 
TGCTTTCCCCTTGGCTAGTTC^G 

TGTGATCCCAGTATTCAGTGTCAGTACTCAGTGACAAAATAAATGAGAGAAACGGGAATAAGAATTGTCG 
CCTACACAAAAATACCAGCAACTGTTAACTC^ 

GAACCCTAAAAAGTCTTTGAAGC^CTCAAGTTTTAAAAAAGGGGAGGAACTCCTGGAAATCTCAGGATG 

GGGCCAAGATGTGGCTGGAGAGTGTGTGGTGATGGAGGGCGTGTCTTTTGCCGAGCACACTCAGGGCCCA 

CGGGAAGCCCATAGACTTCAAGGACATCAAGCCCCAAGGTGGTGGGATTTTCCCCACCAGTACTTGGCAG 

CCTAGGGGGAAGGGGAGGGCGGGAGAAGATAATGGGGATCCCTGGCTCCAAACATAGGAGGACACATCTG 

TGCTAGAGTGCGCACATGCOTGGATGTACACTCTGTCTTTGGAGACACTGGCTAAGATTCTCT 

GTTTGGACAGGGTCGTGCCTGATCTGAGATAAATGGACAAGAACAACTGAAGCCTGTCTTCTGGTGCATG 

TGTCACCTGCCGATAACTGCATCTTGTGATAAAGTTGGGTGATTTACAGTCTCCACCAAATGCTAAACTC 

TGGGGTCTTACGCCTTTATAACTCCATGGGCCCCAGCAAAGGTTCAGGCTCAAAACAGGTGTCA 

TAACTGTTGAATGATTGTTCCCCAGTTGCAGGCTCTGCCACCTGGCGTTCATACTGTCTGTGAAAGGACC 
CAGCTC^CCTTTCCCTCTTTATCT^ 

AATGTGCTACTTCTCACTTCTGTGTGGCCCGAGGAGGCTGGGTTAATGCTGGGCTTGGTACCTTAAGCAC 

CCTTTCTCCCTTCCCCATCTTCATTCTCAGAATTACACCTGTCTGAAGCAGGCATTTTCCAATGCCCTAG 

ATGGGAATATAAGTGTAAGGAGATGTGAAG(^TTTGCCTGTGTGTC^GAAC^TTCACTGAGGATCCTCAT 

AGGCACTTCTAGAAACCAAATCCTTGAAGATGACTAACCAGAAATGC^ 

GCAAAAACTGAAGCCAATTGAAATGTCGATCAGGAGGGGATT^^ 

GAATATTTTACAGCCATTGAAGATGATATATAGCTATATTCATTGACAAGGAAAACTCATATTTTTTAGT 
GAAAAAAGCAGGTTATAGAATTGCATGATATTCAC^ 

ATTGAATTGTTAATAACTATGGTCACCTCTAGAGATGGAAGTTTGCATTACCTTTAATTTTTAATACCAT 

TTTGTATTGCTTAAAATTTGTATGTATTATCGTTAAAATAAGAAAAATCAAATAAAGCTA^ 

GGGAAAAAAAAAAAAAAAAA 

Human SNX1 mRNA sequence - var5 (pubHc gi: 4406620) (SEQ ID NO: 156) 

ATAAAAGGTATGGAACCTGCTTTCCCCTTGGCTAGTTCAGCGTTTGGGCTCCGGAGTGCTGAAGATGAGG 

ACTGGACTTCGAGCTGGTGTGATCCCAGTATTCAGTGTCAGTACTCAGTGACAAAATAAATGAGAGAAAC 

GGGAATAAGAATTGTCGCCTAC^CAAAAATACGAGCAACTGTTAACTCTTCCCAGAAGATTTT 

AATGCTCCTGTAGCTAGGAACCCTAAAAAGTCTTTGAAGCAACTCAAGTTTTAAAAAAGGGGAGGAACTC 

CTGGAAATCTCAGGATGGGGCCAAGATGTGGCTGGAGAGTGTGTGGTGATGGAGGGCGTGTCTTTTGCCG 

AGCAC^CTCAGGGCCCACGGGAAGCCCATAGACTT 

CCACCAGTACTTGGCAGCCTAGGGGGAAGGGGAGGGCGGGAGAAGATAATGGGGATCCCTGGCTCCAAAC 
ATAGGAGGACACATCTGTGCTACAGTGCGCACATGCCTGGATGTACACTCTGTCTTTGGAGACACTGGCT 
AAGATTCTCTGCTCGATGTTTGGACAGGGTCGTGCCTGATCTGAGATAAATGGACAAGAACAAC^ 
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CTGTCTTCTGGTGGATGTGTCACCTC 

CACCAAATGCTAAACTCTGGGGTCTTACGCCTTTATAA.CT 

AACAGGTGTCAAATAGATAACTGTTGAATGATTGTTCCCCAGTTGCAGGCT 

CTGTCTGTGAAAGGACCCAGOTCACCTTTCCCTCTTTATCTCCCAGTCCTTC 

CATGGAAACTGATTGCAAATGTGCTACTTCTCACTTCTGTG 

CTTGGTACCTTAAGCACCCTTTCTCCCTTCCCCATCTTCATTCTCAGAATTA 

ATTTTCCAATGCCCTAGATGGGAATATAAGTGTAAGGAGATGTGAAGCATTTGCCTGTGTGTCAGAACAT 

TCACTGAGGATCCTCATAGGCACTTCTAGAAACCAAATCCT 

TAGCACTGTTTACAGTTGCAAAAACTGAAGCCAATTGAAA 

GGTACAGTTACACCGTTGAATATTTTACAGCCATTGAAGA 

AACTCATATTTTTTAGTGAAAAAAGCAGGTTATAGAATTC 

ATATGGGAAGGATGTTGATTGAATTGTTAATAACT^^ 

TTAATTTTTAATACCATTTTGTATTGCTTAAAA 

AAAGCTATTTTCATTATGGGAAAAAAAAAAAAAAAAAAAA/^AAAAAAAAAAAAAAAAAA 

Human SNX1 mRNA sequence - var6 (public gi: 34535422) (SEQIDNO: 157) 

TTTCCGCCGCGGGTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCG 

CCCTTCCCCGGCCTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGA 

ACACCGAGGGGGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATC^ 

ATCCCTTCTTCCCATCAACAATGGCTCCAAAGAAAATG 

GATCTCTTTGCT^GATGCCACAGTGGAGCTATCCTTGGACAC^^ 

CCAAAACACTCATTTCTCTTCCTCCTCAGGAAGCCACAAATT 

GCTAGAGGAAGAAGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGA 

GATGGTATGAATGCATATGTAGCCTACAAAGTTACAAGACAGACAAGCTTACCATTGTTC^ 

AGTTTGCAGTAAAAAGAAGATTTAGTGACTTTCTGGGTCT^^ 

GAATGGCTTC^TTGTCCCTCCACCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTGGGAAG 
GAAGATTCTTCTTCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTG 
TAAATCATCCTACCATGTTACAGGACCCTGACGTGAGAGAGTTCTTGGAAAAAGAAGAGCTGCCACGTGC 
CGTGGGTACCCAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTTGAACAAAGCCACAGATGCCGTCAGC 
AAAATGACCATCAAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGG 
AGCAGCGCTTACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAA 
CACAGCCCAGTTTGCAAAGAGTCTAGCCATGCTTGGGAGCTC 

CTCTCCCAGCTGGCTGAGGTGGAAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACA 
TCCTCCTTGCTGAGCTCCTGAGTGACTAC^TTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCG 
CATGAAGACATGGCAGCGCTGGCAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGG 
CTGCTGTGGGCGAACAAGCCTGATAAGCTGCAGCA^ 

TGACTCAATATGAAAGGGACTTCGAGAGGATTTC^y^GAGTGGTCCGAAAAGAAGTGATACGGTTO 

AGAGAAATCCAAGGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCTTTGCTCACAG 

GCTGGGGAGCAGTTGGGAATCAGGTCTGGAATACTCCTAACCAAGAAGTTGCCCAGGTATAGTAAGTTTT 

TCTCTACCGTTCACAAGTTTTGTGCTGCTGCrTCCCTCTGGAAATGGGGTTTCTTTCTCTCCGCCTACCT 

CAGCTACCTGTTCTGAGGGTCTCAATCTGTTTCGTATTCCCACTTCTTTAGGGAAGGAGTTTTAAAAACA 

TCTCTTAAAATAAGAGGAGCAAAATCTATTAAAACCTATTCTCCTGCAAAGGAGGCAGAGACTTTCTCTC 

TCTCTTTTTTTTTTTTTTTTTGGTGTCCCTATCATTAAGGAAGAGCCTTTCTCTTTTATTOTTCCTGCT 

CCCTAAGCTGCTCAGGGCTCTCTGAGTCTTGCCCTCTGATGGCAAGTCTTATATATAACTAAACCTATTT 

TTGTCACCCATCAAAACACATCCTCAGTAGACTGTGTGAAGGTGTGAAGGTCTGATAATGACTTGATGCT 

TTATCTCCATAGACATGAAAGCCATGCCCTCTGCCTCTAGATAGGGTGATCGAAGAGOTCCTGAACCTTA 

GGAGGTTCAAAGAAGCTCTACTGTCTGTGCCCAGGAGGTAGCCTGCCAGCAAGAGCCCTCAGGAGTTGCA 

CACACA.GCCAAAGGGTGTTCACACAGATCTCTGCCGGTCTAGCCAGGGGAGGCCAGAGTCTCGTCAGTCA 

AGGATGGGCTTCCCCCTTAGCTGTGTCCAGAGCTGCTCAAGCTATACTGGTCAGAGTGGGCTTTGAAGCT 

CCTTTGTGAGCTCGAGCTGdTGACTGCCACTATGGGAGCCTTGCCACCTCCAGCCCCTCCATCCCAAAGA 

CGCTCCTGCCACTGGGGCCCCAGGTCCTGCTGTATCAGTTCTCTTTGGTGGGGGGCTAAGGTTTGGGGCG 

AGGCAACCTGAGAC^GAAAACGCAGTAAACATTCTGATTCCCTGTACACAGATGCAGCACCAGGGGAAG 

GGCGAGTGGTGCAAGTATTTCTTTTTAACAGGTGAAGTTTTTGGAAAAAGTCACTCTCCCTACCCCTCAG 

TATCCTTACCATCAACTTTGGTTTTATCCTTCCAGTCTTTATTATATGCTTGCTTTTACATAGTTGTAAT 

AATATACACATAAAGTATTTTGTATCCTGCTTTTAT 

ATATTGTTATATATCTTCAC7VAAGTTGATCTGTAAAGCTGTGTAATTTGAAGGCATCCATAGGGTGACTG 
TACCATAATTTTGATTCATCCCTTGTTGTTGGATTCTTGGTCAGGGGTTGTTTGTTTTGTTTTATTGGTA 
ACTTTAAAATTTTGAATACAATTTCAGATTTACAGAAAAG 

ATCTTTTATCCAGATTTACTGAGTGTTTACATTTTATCCCATTTGCCTTATCTATATTTCATGTTGCATT 
TTCTTAATCATTTGAGAATAATTTGCACAGATACCCCATTATGCCCAAAACAGTATGCATTTCCCTAAGA 
ACAGGACATTCTCTTCTAAGAGAAGAAGAGAATTACTTTAAGCATTATTCAGTATTTTTTTAAGTATTAT 
TATCAAAATCAGGAAGTTTAACAGTGATTTAATACTGTTATCTAACCCATGATTCATATTTAAATTTTGC 
CATTTATCCCAATAATGTCCTTTGTAGCCATTCTTTTAC^ 
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TGTGTCrCTCAATACTGCAGATTCCTCAACTT^ 
CAGGCTATTTTGTCG 

Human SNX1 mRNA sequence - var7 (pubUcgi: 38197125) (SEQ ID NO: 158) 

GTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTO^ 

CTGGAGCCGGAGTCCX^GGGGGCGGCCGGGGGATCAGAACCC^ 

AGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCTCCAAAGA 

CATCAACAATGGCTCCAAAGAAAATGGGATCCATGAAGAACAAGAC 

GATGC(^CAGTGGAGCTATCCTTGGACAG(^CACAAAATAATCAGAA 

TTTCTCTTCCTCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCCAACCTATGAGGAGCTAGAGGAA 
AGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATAGGGGATO 
GCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATTGTTCAGAAGCAA 
AAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCT 

TGTC CCT CCAC CCCCGGAGAAGAGCCT CATAGGGATGACAAAAGTGAAAGTTGGGAAGGAAGATTCTTCT 

TCTGCAGAATTTCTTGAAAAACGGAGGGCCX^TTTAGAAAGGT^^ 

CCATGTTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTC 

GACATTGAGTGGTGCTGGTOTCCTCAAGATGTTCAACAAAGCCACAG 

AAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAG 

GGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAA 

TGCAAAGAGTCTAGCCATGCTTGGGAGCTCT^ 

GCTGAGGTGGAAGAAAAAATTGAGC^GCTCCACCAGGAACAGGCCAACAATGACTTCTTCCTCCTTGCTG 
AGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCGCATGAAGACATG 
GCAGCGCTGGCAGGATGCCCAAGCCACACTGCAGAAGAAGCGGGAGGCCGAGGCTCGGCTGCTGTGGGCC 
AACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCTCGGGTGACTCAATATG 
AAAGGGACTTCGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATACGGTTTGAGAAAGAGAAATCCAA 
GGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCTT^ 

TGGGAAGCCTTCCTTCCTGAGGCAAAGGCCATCTCCTAATGGACCAAGGACCCCAGAGCCCACCTG 

ATOCTGCCTTTTTATACACTGTCCTCCTCCACCTTGATGGACCCCTAGTGATGCATCCTGCCTAGGCTGG 

ACTTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACT 

TCCATATATATTTTCTTACCTAAGAGAATAGTTTCCTGCTTTAAGCAAAAGACCTACAATAGGTGGTGGA 
ATTATGGGATGGGGTGGAGTATTGATATAAATATATAAATACAAATGTATATTTTTCAGGATGTGGTTTA 
GGAACTGGGAATAACGTTTTCTGTTACTCCTGATGGTGC(^TGAAAAGGTTATGT^TAAAATATTTTAA 
AATCAAAAAAAAAAAAAAAAAA 

Human SNX1 mRNA sequence - var8 (public gi: 23111033) (SEQ ID NO: 159) 

GGGTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTGGGAGAGACTGCCTCCGCCCTTCCCCG 

GCCTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGG 

GGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCTCCAAAGATAACTAC7VTCCCTTCTT 

CCCATCAACAATGGCTCCAAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGAT^ 

CAGATGCC^CAGTGGAGCTATCCTTGGACAGCACAaVAAATAATCAGAAGAAGGTGCT 

CATTTCTCTTCCTCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCCAACCTATGAGGAGCTAGAGGAA 

GAAGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATAGGGGATGGTATGA 

ATGCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATTGTTGAGAAGCAAACAGTTTG 

AAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTT^ 

ATTGTCCCTCCGCCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTC^ 

CTTCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGATTGTAAATCATCC 
TACCATGTTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGCCACGTGCCGTGGGTACC 
CAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTTCAACAAAGCCACAGATGCCGTCAGCAAAATGACCA 
TCAAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTGTGAGGAGCAGCGCTT 
ACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCTAGCGCTGAACACAGCCCAG 
TTTGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAACACGG 

TGGCTGAGGTGGAAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAATGACTTCT[TCCTCCTTGC 

TGAGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGACCAGCG(^TGAAGACA 

TGGCAGCGCTGGCAGGATGCCCAAGCC^CACTGCAGAAGAAGCGGGAGGCCGAGGCTCGGCTGCTGTGGG 

CCAACAAGCCTGATAAGOTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTCTCGGGTGACTCAATA 

TGAAAGGGACTTCGAGAGGATTTCAACAGTGGTCCGT^AAAGAAGTGATACGGTTTGAGAAAGAGAAATCC 

AAGGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCTTTACTCACAGCAGCAGCT 

ACTGGGAAGCCTTCCTTCCTGAGGCAAAGGCCATCTCCTAATGGACCAAGGACGCCAGAGCCCACCTGTG 

TGACGCTGCCTTTTTATACACTGTCCTCCTCCACCTTGATGGACCCCTAGTGATGCATCCTGCCTAGGCT 

GGACTTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACTCTAACCGTTATTTCATTTAG 

CTTCCATATATATTTTCTTACCTAAGAGAATAGTTTCCTGCTTTAAGCAAAAGACCTACAATAGGTGGTG 

GAATTATGGGATGGGGTGGAGTATTGATATAAATATATAAATACAAATGTATATTTTTCAGGATGTGGTT 
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TAGGAACTGGGAATAACGTTTTCTCT 
AAAATCAAAAAAAAAAAAAAAAAA 

Human SNX1 rnRNA sequence - var9 (public gi: 23111035) (SEQIDNO: 160) 

{XX3TGGAAGAAGATGGCGTCGGGTGG 

GCCTGGAGCCGGAGTCCGAGGGGGCX3GCCGGGGGATCAGAACCCGA 

GGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCT 

CCCATCAACAATGGCTCCAAAGAAAATGGGATCCATGAA 

CAGATGCCACAGTGGAGCTATCCTTGGACAGCACACAAAATA 

CATTTCTCTTCCTCCTCAGGAAGCCACAAATTCTTCGAAGCCCCAGCC^ 

GAAGAACAGGAGGATCAATTTGATTTGACAGTCGGTATAACTGATCCTGAGAAGATAGGGGAT 

ATGCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATTO 

AAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCACTCT 

ATTGTCCCTCCGCCCCCGGAGAAGAGCCTCATAGGGAT^ 

CTTCTGCAGAATTTCTTGAAAAACGGAGGGCCGCTTTAGAAAGGTACCTTCAGAGGAT 

TACCATGTTACAGGACCCTGACGTCAGAGAGTTOTTGGAAAAAGAAGAGCTGCCACGT 

CAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTTCAACAAAGCCACAG 

TCAAGATGAATGAATCAGACATTTGGTTTGAGGAGAAGCPrC 

ACGGAAACTGCATGCTGTTGTAGAAACTCTAGTCAACC^^ 

TTTGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGAC^ 

TGGCTGAGGTGGAAGAAAAAATTGAGCAGCTCCACCAGGAACAGGCCAACAA 

TGAGCTCCTGAGTGACTACATTCGCCTCCTGGCCATAGTCCGCTGGGAGTCTCGGGTGACTCAATATGAA 
AGGGACTTCGAGAGGATTTCAAGAGTGGTCCGAAA^ 
ACTTCAAGAACCACGTGATCAAGTACCTTGAGACACT^ 
GGAAGCCTTCCTTCCTGAGGCAAAGGCCATCT 

GCrGCCTTTTTATACACTGTCCTCCTCCACCTTGATGGACCCCTAGTGATGCATCCrrGCOT 
TTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACTCTAACCGTTATTTCATTTAGCTTC 
GATATATATTTTCTTACCTAAGAGAATAGTTTCCTGCTTTAAGCAAAAGACCTACAATAGGTGGTGGAAT 
r TATGGGATGGGGTGGAGTATTGATATAAATATATAAATACAAATGTATATTTTTCAGGATGTGGTTTAGG . 
AACTGGGAATAACGTTTTCTGTTACTCCTGATGGTGCCATGAAAAGGTTATGTAATAAAATATTTTAAAA 
TCAAAAAAAAAAAAAAAAAA 

Human SNX1 raRNA sequence - varlO (public gi: 23111031) (SEQ ID NO: 161) 

GGGTGGAAGAAGATGGCGTCGGGTGGTGGTGGCTGTAGCGCTTCGGAGAGACTGCCTCCGCCCTTCCCCG 
GCCTGGAGCCGGAGTCCGAGGGGGCGGCCGGGGGATCAGAACCCGAGGCTGGGGACAGCGACACCGAGGG 
GGAGGACATTTTCACCGGCGCCGCGGTGGTCAGTAAACATCAGTCT 
CCCATGAACAATGGCTCCAAAGAAAATGGGATCC^ 

CAGGGGATGGTATGAATGCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATTGTTCAGAAG 
CAAACAGTTTGCAGTAAAAAGAAGATTTAGTGACTTTCTGGGTCTTTATGAGAAGCTTTCCGAGAAGCAC 
TCTCAGAATGGCTTCATTGTCCCTCCGCCCCCGGAGAAGAGCCTCATAGGGATGACAAAAGTGAAAGTTG 
GGAAGGAAGATTCTTCTTCTGGAGAATTTCTTGAAAA 

GATTGTAAATCATCCTACCATGTTACAGGACCCTGACGTCAGAGAGTTCTTGGAAAAAGAAGAGCTGCCA 

CGTGCCGTGGGTACCCAGACATTGAGTGGTGCTGGTCTCCTCAAGATGTTCAACAAAGCCACAGATGCCG 

TCAGCyUUU^TGACCATCAAGATGAATGAATC^GACATTTGGTTTGAGGAGAAGCTCCAGGAGGTAGAGTG 

TGAGGAGCAGCGCTTACGGAAACTGC^TGCTGTTGTAGAAACTCTAGTCAACCATAGGAAAGAGCT 

CTGAAC^CAGCCCAGTTTGCAAAGAGTCTAGCCATGCTTGGGAGCTCTGAGGACAACACGGCATTGTCAC 

GGGCACTCTCCCAGCTGGCTGAGGTGGAAGAAAAAATTGAGCAGCTCCACC^ 

CTTCTTCCTCCTTGCTGAGCTCCTGLAGTGACTACATTCGCCTCCTGGCCATAGTCCGCGCTGCCTTCGAC 
CAGCGCATGAAGAC^TGGCAGCGCTGGCAGGATGCCC^AGCCACACTGCAGAAGAAGCGGGAGGCCGAGG 
CTCGGCTGCTGTGGGCCAACAAGCCTGATAAGCTGCAGCAGGCCAAGGACGAGATCCTCGAGTGGGAGTC 
TCGGGTGACTCAATATGAAAGGGACTTCGAGAGGATTTCAACAGTGGTCCGAAAAGAAGTGATACGGTTT 
GAGAAAGAGAAATCCAAGGACTTCAAGAACCACGTGATCAAGTACCTTGAGACACTCCTTTACTCACAGC 
AGCAGCTGGCAAAGTAGTGGGAAGCCTTCCTTCCTGAGGC^ 

CAGAGCCCACCTGTGTGACGCTGCCTTTTTATACACTGTCCTCCTCCACCTTGATGGACCCCTAGTGATG 
CATCCTGCCTAGGCTGGACTTAACCCCTTCCTCCCTGTCCCCACGACCAACTGTCCCCAGTTACTCTAAC 
CGTTATTTCATTTAGCTTCCATATATATT^ 

CTACAATAGGTGGTGGAATTATGGGATGGGGTGGAGTATTGATATAAATATATAAATACAAATGTATATT 
TTTCAGGATGTGGTTTAGGAACTGGGAATAACGTTTTCTGTTACTCCTGATGGTGCCATGAAAAGGTTAT 
GTAATAAAATATTTTAAAATCAAAAAAAAAAAAAAAAAA 

Human SNX1 protein sequence - varl (public gi: 23 111032) (SEQ ID NO: 281) 
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14ASGGGGCSASBRI*PPPFPGIjEPBSEGAAGGSBPEAGDSDraGEDIFTGAAVVSKHQSPKITTSLLPINN 
GS KENGIHEEQDQEPQDIiFAGDGMNAYVAYKVTTQTSIiPLFRS KQFAVKRRFSDFLGIjYEKIiSEKHSQNG 
FIVPPPPEKSLIGl^KVKVGKEDSSSAEFIiEKRRAAIiERYI^ 
TQTLSGAGLLKMFNKATDAVS KMTI KMNESDI WFEEKLQEVECEEQRLRKLHAVV^ 

QFAKSLAMLGSSEDNTALSRAIiSQIiAEVEEKI EQLHQEQANNDFFLI1AEI1LSDYI RLLATVRAAFDQRMK 

TWQRWQDAQATLQKKREAEARLLWANKPDKIjQQAKDEILEWESRVTQYE 

SKDFramVIKYLETIiLYSQQQIiAKYWEAFLPEAKAIS 

Human SNX1 protein sequence - var2 (public gi: 23111036) (SEQ ID NO: 282) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDI FTGAAWSKHQS PKITTSIjIjPINN 
GSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVIiAKTLI SLP PQEATNSSKPQPTYEEIjEEEEQE 
DQFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSK^ 

PPEKSLIGMTKVKVGKEDSSSAEFIiEKRRAALERYLQRIVNHPTMLQDPDVREF 
GAGIxLKMFNKATDAVSKMTIKMNESDIWFEEKI^ 
IiAMLGSSEDOTALSRALSQIiAEVEEKIEQIiHQEQANNDFFIjIiM^ 
RISTVVRKEVIRFEKEKSKDFKNHVIKYXiETLLYSQQQLAK™ 

Human SNX1 protein sequence - var3 (public gi: 12653179) (SEQ ID NO: 283) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPKITTSLLPINN 
<3SKENGIHEEQDQEPQDLFADATVELSLDSTQNNQK3<^^ 
IXSFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQFAVK^ 
PPEKSLIGMTKVKVGKEDSSSAEFLEKRRAALERYLQRIVNHPTMI^DPDVR^ 

GAGLLKMFNKATDAVS KMTI KMNE SD I WF EE KLQEVE CE EQRLRKLHAWE TLVNHRKEIiALNTAQ FAKS 
I1AMLGSSEDNTALSRAI1SQLAEVEEKIEQLHQEQ 
QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQ^ 
NHVI KYLETLLYSQQQLAKYWEAFLPEAKAI s 

Human SNX1 protein sequence - var4 (public gi: 3453542 3) (SEQ ID NO: 284) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPKITTSLLPINN 

GSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVIAKrLISLPPQEATNSSKPQPTYEELEEEEQE 

DQFDLTVGITDPEKIGDGMNAYVAYKVTTO^ 

PPEKSLIGMTKVKVGKEDSSSAEFLEKRRAALERYIiQRIVNH^ 

GAGLLKMFNKATDAVSKMTIKMNE^ 

IiAMLGSSEDNTALSRAIiSQIiAKTOEKIEQLHQEQA^DF 

QDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQ^ 

NHVIKYLETLIiCSQQQAGEQIiGIRSGILLTKKLPRYSKFFSTVHKFCAAASLWKW 

Human SNX1 protein sequence - var5 (public gi: 3152942) (SEQ ID NO: 285) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDIFTGAAVVSKHQSPKITTSLIjPINN 

GSKENGIHEEQIXiEPQDLFAGDGMNAYVAYKVTTQTSLPLFRSKQFAVKR^ 

FIVPPSPEKSLIGMTKVKVGKEDSSSAEFLEKRRAAIjERYIOR^ 

TQTLSGAGLLKMFNKATDAVSKMTIKMNESDIWFEEKIiQEVECEEQRIjRK^ 

^FAKSIJMLGSSEDNTALSRALSQLAEVEEKIEQLHQEQA*^ 

TWQRWQDAQATLQKKREAEARLLWANKPDKLQQAKDEILEWESRVTQYERDFERIS 

SKDFKNHVI KYLETIiLYSQQQLAKYWEAFLPEAKAI S 

Human SNX1 protein sequence - var6 (public gi: 3152940) (SEQ ID NO: 286) 

MASGGGGCSASERLPPPFPGLEPESEGAAGGSEPEAGDSDTEGEDI FTGAAWSKHQS PKITTSLLP INN 

GSKENGIHEEQDQEPQDLFADATVELSLDSTQNNQKKVLAKTLISLSPQEATNSSKPQPTYEE 

DQFDLWGITDPEKIGDGMNAYVAYKVTTQTSI1PI1FRSKQFAVKRRFSDFLGLYEKI1SEKHSQNGFIVPP 

SPEKBLIGMTKVKVGKEDSSSAEFLEK3^AALERYLQRIVNHPTMLQDP 

GAGLLKMFNKATDAVSKMTIKMNES 

LANrLGSSEDNTAIiSRADSQLAEVEEKIEQL^^ 

QDAQATLQKKREAEARLLWANKPDKI^ 

NHVIKYLETTiliYSQQQLAKYWEAFLPEAKAIS 



Unigene Name? SNX3 Unigene ID; Hs. 12102 

Human SNX3 mRNA sequence - varl (public gi: 23111040) (SEQ ID NO: 162) 
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CTGTTTGCGACCCCGAGTCCCATGACA 

TCX3GCCGGGCCTCCCXKK3AGCCGGGCGTGGCGTTCCAGCTAGTGAGC 

CGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGTGAACGCC 

TCGCGACCGCTGCGCGCGAGCCCCGTGTCCCCACGG^ 

GAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAG CGAAA.TGGCGGAGACCGTGGCTGACACCC 
GGCGGCTGATCACCAAGCCGCAGAACCTGAATGACG 
TGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCT^ 
GTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGT 

ATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAA 

GGCACAGAACGAACGTTGTCTTCA.CATGTTTTTACAAGATGAA^ 

AAAATAAGACA.TGCCTGAAATTTGGCAAGAAGGGGCAAAAAOT 

TGAAGAAGTTCTAACTTTTAGGATGCTGCACAGAAACTGG 

TATGCTCAGTTTTGTTTTGTTTTGGCAGTTGACAAGAAGTTAATTTGCTTTA 

AGCCTTTCTATATAAATAGCTCTTTCTTGCTGTTTTAATGTGGTGC^ 

ATTCCAGTGTAATCTGCAGTGTCGTAACTAAAGTTACTGGCTTGGTCTTATTTC 

TGTTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCTTTCCCCCTTTTAATTTGTGATG 

CCCATTTATGTGTAGGAGCACTACACCATTGGTTTCCAATACTGCACACATAAGA 

AGAAAGTATCTTCCTCCAGGCTTGTAATACCCT 

GTATAAAAACAAAAGC?yUVTTTATATACTAAAATCATTTGTCTAAAAATTT 
ATTAAAATGCATTTOTGATATGCAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 raRNA sequence - var2 (public gi: 3430437 5) (SEQ ID NO: 163) 

GTCCGGCCGGAACCTGTTTGCGACCCCGAGTCCGM 

CCCCTCCCCAGCCTCGGCCGGGCCTCCCGGGAGCCGGGCGTGGCGTTCCAGCTAGTGAGCCGTTTCTCCC 
CTGGGCTCGGAGGCGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGTGAACGCCCGCTCTAC 
GTGCTCGTTC^CTTCGCGACCGCTGCGCGCGAGCCCC^ 

GTOGCTGAACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACC 

GTGGCTGACACCCGGCGGCTGATCACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGC^ 

TCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAAT 

CAGGGTCAAGACAAATCTTCCTATTTTCAAGCTGAAAGAAT 

GAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGCCCTGCCTCAGAATGA 

ATGGAAGGACGTGGTGTGCTCAGAATGATGAAAAGTTATTTTGTGACTAGAAAGTCGTAGTTCCCCCGCT 

CCCTGGGAAAGCGTTTTTGCGTC^GCTTCCTTTTAGAGGAGATGATGGAATATTTGATGACT^TTTTATT 

GAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTGGTCATCCTCTGGCACAGAACGAAC 

GTTGTCTTCACATGTTTTJACAAGATGAAATAATAGATAAAAGCTATACTCCATCTi^AATAAGACATGC 

CTGA^TTTGGCAAGAAGGGGCAAAAACGTGACTATTAATGATTGATAAGCACCAGTGAAGAAGTTCTAA 

CTTTTAGCATGCTGCACAGAAACTGGTATAACATGCCTTCAGTAT^ 

TTTTGTTTTGGCAGTTGACAAGAAGTTAATTTGCTTTAGT^ 

AATAGCTCTTTCTTGCTGTTTTAATGTGGTGCACACTATAGCCTCACAAACCTGTTATTCCAG 
TGCAGTGTCGTAACTAAAGTTACTGGCTTGGTCTTATTTGCACAGTTTTTGCGTCTTGTTTGCTTCTTGC 
ATCTGATTAACTAGAATATTTCTCTTTCCCCCTTTTAATTTGTGATGTCACTTGACCCCATTTATGTGTA 
GGAGCACTACACCATTGGTTTCCAATACTGCACACATJ^ 

TCCAGGCTTGTAATACCCTTCACATGGAAGATTAATGAGGGAAATCTTTATATTCTGTATAAAAACAAAA 
CTGATATGCAAAAAAAAAAAAAAAAAAAAAAA. 

Human SNX3 raRN A sequence - var3 (public gi: 34190889) (SEQ ID NO: 164) 

TCGACCCACGCGTCCGCCCACGCGTCCGCTGTTTGCGACCCCGAGTCCCATGACACCGCTTCTCCTCACA 
CCCCAGTCCGCAGTGCCCCTCCCCAGCCTCGGCCGGGCCTCCCGGGAGCCGGGCGTGGCGTTCCAGCTAG 
TGAGCCGTTTCTCCCCTGGGCTCGGAGGCGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGT 
GAACGCCCGCTCTACGTGCTCGTTCTCTTCGCGACCGCTGCGCGCGAGCCCCGTGTCCCCACGGCGGGCA 
GCAGCGGCGGCGGCGGCGGCTGAACGCGGAGGGGGCGGAGGGAGCCCGCGGCX3GCGGCAGCAGCTACAGC 
GAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCACCAAGCCGCAGAACCT 

GACCCCCCAGCAACTTCOTCGAGATCGATGTGAGCAACCCGGAAACGGTGGGGGTCGGCCGGGGCCGCTT 

CACGACTTACGAAATCAGGGTCAAGACAAATCTTCCTATTTTCAAGCTGAA^ 

AGATACAGTGACTTTGAATGGCTGCGAAGTGAATTAGAAA^^^ 

CAGAGGCAA.GGAGTCATGGAAGGACGTGGTGTGCTCAGAATGATGAAAAGTTATTTTGTGACTAGAAAGT 

CGTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGT.CAGCTTCCTTTTAGAGGAGATGATGGAATATTT 

GATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTGG 

TGGCACAGAACGAACGTTGTCTTGA.CATGTTTTTACAAGATGAAATA^ 

TAAAATAAGACATGCCTGAAATTTGGCAAGAAGGGGCAAAAAC^ 

GTGAAGAAGTTCTAACTTTTAGCATGC.TGCACAGAAACTGGTATAACATGCCTTCAGTATACTAACACTC 



Figure 36 part -91 



136/202 



WO 2004/078130 



PCT/US2004/006308 



ATATGCTCAGTTTTGTTTTGTTTTGGCAGTTGACAAGAAGTTAATTTO 

CAGCCTTTCTATATAAATAGCTCTTTCTTGCTGTTTTA 

TATTCCAGTGTAATCTGCAGTGTCXSTAAOTAAAGTTACTGGC^ 

TTGTTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCTTTCCCCCT 

CCCC^TTTATGTGTAGGAGCACTACAC 

CAGAAAGTATCTTCCTCCAGGCTTGTAATACCC^ 

TGTATAAAAACAAAAGCAAATTTATATACT^ 

AATTAAAATGCATTTCTGATATGCAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var4 (public gi: 15779011) (SEQ ID NO: 165) 

GGGGCTTCGCGACCGCTGCGCGCGAGCCCCGTGTCCCCACGGCGGGC^GCAGCGGCGGCGGCGGCGGCTG 
AACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTG 
ACACCCGGCGGCTGATCACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACTT 
GATCGATGTGAGGAACCCGCAAACGGTGGGGGTC^ 

AAGACAAATCTT CCTATTTTCAAG CTGAAAGAATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGC 

tgcgaagtgaattagaaagagagagcaaggtcgtagttcccc^ 

gcttccttttagaggagatgatggaatatttgatgacaattttattgaggaaaga 

cagtttataaacaaggtcgctggtcatcctctggc^ 

atgaaataatagataaaagctatactccatctaaaataagacatgcctgaaatttggcaagaaggggc^ 

aaacgtgactattaatgattgataagcacgagtgaagaagttctaacttttagcatgct 

ggtataacatgccttcagtatactaacactcatatg 

gttaatttgctttagtaaaaatccctrcact 

tgtggtgcacactatagcctcacaaacctgttattccagtgtaatcrrgcagtgtcgtaactaaagttact 

ggcttggtcttatttgcacagtttttgcgtcttgtttgcttcttgc^tctgattaactagaatatttctc 

tttcccccttttaatttgtgatgt(^cttgaccccatttatgtgtaggagcacta(^cc^ttggtttcca 

atactg(^cac^taagatacatacttgtgtgcagaaagtatcttcctccaggcttgtaatacccttcaca 

tggaagattaatgagggaaatctttatattctgt^^ 

tgtctaaaaatttaagttgttttcaaataaaaattaaaato 

AA 

Human SNX3 mRNA sequence - var5 (public gi: 15929496) (SEQ ID NO: 166) 

cgcgcgagccccgtgtccccacggcgggcagcagcggcggcggcggcggctgaacgcggagggggcggag 

ggagcccgcggcggcggcagcagctacagcgaaatggcggagaccgtggctgacacccggcggctgatca 

ccmgccg<^gaacctgaatgacgcctacggaccccccagcaacttcctcgagatcgatgtgagcaaccc 

g<3^aacggtgggggtcggccggggccgcttcaccact 

ttcmgctgaaagaatctactgttagaagaagatacagtgactttgaatggctgcgaagtgaattagaaa 

gagagagcaaggtcgtagttcccccgctccctGtGgaaagcgttt 

tgatggaatatttgatgacaattttattgaggaaagaaaacaagg 

GCTGGTC^TCCTCTGGCA(^GAACGAACGTTGTCTTCACATGTTTTTAC^UVGATGAAATAATAGATAAAA 

gctatactccatctaaaataagacatgcctgaaatttggcaagaaggggcaaaaacgtgactattaatga 
ttgataagcaccagtgaagaagttctaacttttagcatgctgcacagaaactggtataacatgccttcag 

TATACTAACACTCATATGCTCAGTTTTGTTTTGT^ 

AAATCCCTCATTCCAGCCTTTCTATATAAATAGCTCTTTCT^ 

CTCACAAACCTGTTATTCCAGTGTAATCTGCAGTGTC 

GA.GTTTTTGCGTCTTGTTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCT 

TGATGTCACTTGACCCCATTTATGTGTAGGAGCACTACACCATTGGTTTCCAATACTGCACACATAAGAT 
ACATACTTGTGTGCAGAAAGTATCTTCCTCCAGGCTTGTAATACCCTTCACATGGAAGATTAATGAGGGA 
AATCTTTATATTCTGTATAAAAACAAAAGCAAATTTATATACTAAAATCATTTGTCTAA 
GTTTTCAAATAAAAATTAAAATGCATTTCTGATATGCAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var6 (public gi: 14250078) (SEQ ID NO: 167) 

AGCCCCGTGTCCCCACGGCGGGCAGCAGCGGCGGCGGCGGCGGCTGAACGCGGAGGGGGCGGAGGGAGCC 
CX3CGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCACCAAGC 
CGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACTTCCTCGAGATCGATGTGAGCAACCCGCAAAC 
GGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAATCAGGGTCy^G 

CTGAAAGAATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAATTAGAAAGAGAGA 

GCAAGGTCGTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGATGATGG 

AATATTTGATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTO 

CATCOTCTGGCACAGAACGAACGTTGTCTTC^CATGTTTTTACAAGATGAAATAATAGATAAAAC^ 

CTCCATCTAAAATAAGACATGCCTGAAATTTGGC^GAAGGGGC^AAAACGTGACTATTAATGATTGATA 

AGCACCAGTGAAGAAGTTCT7VACTTTTAGCATGCTGCACAGAAACTGGTATAACATGCCTTCAGTATACT 

AACACTCATATGCTCAGTTTTGTTTTGTTTTGGCAG 



Figure 36 part -92 



137/202 



WO 2004/078130 



PCT/US2004/006308 



CTCATTCCAGCCTTTCTATATAAATAGCT 
AACCTGTTATTCCAGTGTAATCTGCAGTGTCGTAACT 
•TTGCGTCn'TGTTTGCTTCTTGCATCTGATTAACrAGAAT 
CACTTGACCCCATTTATGTGTAGGAGCACTA^ 

TTGTGTGCAGAAAGTATCTTCCTCCAGGCTTGTAATACCCTTCACATGGAAGATTAATGAGGGAAATCTT 

TATATTCTGTATAAAAACAAAAGCAAATTTATATACTAAAATCATTTGTCTAAAAATTTA 

AAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var7 (public gi: 12957159) (SEQ ID NO: 168) 

GGGCGAGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGG 

CGGCTGATCACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGCAACTTCCTC 

TGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCT 

AGTTCCCCCGCTCCCTGGGAAAGCXjTTTTTGCGTCAGCTTCCITTTAGAG 

GACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTT^ 

(^(^GAACGAACGTTGTCTTCACATGTTTT^ 

AATAAGACATGCCTGAAATTTGGCAAGAAGGGGCAAAAACGTGACTATTAATGATTGATAA 
AAGAAGTTCTAACTTTTAGCATGCTGCACAGAAACTGGTATAACATG 

TGCTCAGTTTTGTTTTGTTTTGGCAGTT'GACAAGAAGTTAATTTGCTTTAGTAAAAATCCCTC^ 
CCTTTCTATATAAATAGCTCTTTCTTGCTGT^ 

TCCAGTGTAATCTGCAGTGTCGTAACTAAAGTTACTGGCTTGGTCTTATTTGCA 

TTTGCTTCTTGCATCTGATTAACTAGAATATTTCTCTTTCCCCCTTTTA 

GATTTATGTGTAGGAGCACTACACC^TTGGT^ 

AAAGTATCTTCCTCCAGGCTTGTAATACCCTTGACATGGAAGATTAATGAGGGAAATCTTTATATTCTGT 
ATAAAAACAAAAGCAAATTTATATACTAAAATCA^ 

TAAAATGCATTTCTGATATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 niRNA sequence - var8 (public gi: 34304374) (SEQ ID NO: 169) 

GTCCGGCCGGAACCTGTTTG03ACCCCGAGTCC 

CCCCTCCCCAGCCTCGGCCGGGCCTCCCGGGAGCCGGGCGTGGCGTTCCAGCTAGTGAGCCGTTTCTCCC 

CTGGGCTCGGAGGCGGAAGCTTGAGGGGCGCGGGGAGGAGCTTCGCGTGCGGGGTGAACGCC 

GTGCTCGTTCTCTTCGCGACCGCTGCGCGCGAGCCCC^^ 

GCGGCTGAACGCGGAGGGGGCGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACC 
GTGGOTGACACCCGGCGGCTGATCACCAAGCCGCAGAACCTGAATC^ 

TCCTOSAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAAT 

C^GGGTCAAGACAAATCTTCCTATTTTCAAGCTGAAAGAATCTACTGTTAGAAGAAGATAC^GTGACTTT 

GAATGGCTGCGAAGTGAATTAGAAAGAGAGAGC^GGTCGTAGTTCCCCCGCTCCCTGGGAAAGCGTTTT 

TGCGTCAGCTTCCTTTTAGAGGAGATGATGGAATATTT 

GCTGGAGC^GTTTATAAAGAAGGTCGCTGGTCATCCTCTGGCA 

TTACAAGATGAAATAATAGATAAAAGCTATACTCCATCTAAA^ 

GGGGCAAAAACGTGACTATTAATGATTGATAAGCACCAGTGAAGAAGTTCTAACTTTTAGCATGCTGCAC 
AGAAACTGGTATAACATGCCTTCAGTATACTAACACTCATATGCTCAGTTTTGTTTTGTTTTGGCAGTTG 
ACAAGAAGTTAATTTGCTTTAGTAAAAATCCCTCATTCCAGCCT 

GTTTTAATGTGGTGC^CACTATAGCCTCACAAACCTGTTATTCCAGTGTAATCTGCAGTGTCGTAACTAA 
AGTTACTGGCTTGGTCTTATTTGCZACAGTTTTTGCGTCTTGTTTGCTTCTTGCATCTGATTAACTAGAAT 
ATTTCTCTTTCCCCCTTTTAATTTGTGATGTC^CTTGA 

GTTTCCT^TACTGCACACATAAGATACATACTTGTGTGCAGAAAGTATCTTCCTCCAGGCT 
CTTCACATGGAAGATTAATGAGGGAAATCTTTATATTCTGT^ 
AATCATTTGTCTAAAAATTTAAGTTGTTTTCAAATA 
AAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SNX3 mRNA sequence - var9 (pubHc gi: 3058306 6) (SEQ ID NO: 170) 

ATGGCGGAGACCGTGGCTGACACCCGGCGGCTGATCACCAAGCCGCAGAACCTGAATGACGCCTACGGAC 
CCCCCAGCAACTTCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCAC 
CACTTACGAAATCAGGGTCAAGACAAATCTTCCTATTTTCAAGCT 

TACAGTGACTTTGAATGGCTGCGAAGTGAATTAGAAAGAGAGAGCAAGGTCGTAGTTCCCCCGCTCCCTG 
<3GAAAGCGTTTTTGCGTCAGCTTCCTTTTAGAGGAGATGATGGAATATTTGATGACAATTTTA 
AAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTGGTCATCCTCTGGCACAGAACGAACGTTGT 
CTTCACATGTTTTTACAAGATGAAATAATAGATAAAAGC^ 

Human SNX3 mRNA sequence - varlO (public gi: 3127052) (SEQ ID NO: 171) 

OGGCGAGGAGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGG 
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CGGCTGATCACCAAGCCGCAGA&CCTG 

TGAGCAACCCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTC^CCACTTA 

TCTTCCTATTTTCAAGCTGAAAGAATCTACTGOT^ 

GAATTAGAAAGAGAGAfiCAAGGTCX»TAGTTCCCCCGCTCCCTGGGAA 

TTAGAGGAGATGATGGAATATTTGATGAC^TTTT 

AAACAAGGTCXK2TGGTCATCCTCTGGCACAGAACGAACGTTG 

ATAGATAAAAGCTATACTCCATCTAAAATAAGAC^TGCCTGAAATTJGGCAAGAA 

CTATTAATGATTGATAAGCACCAGTGAAGAAGTTCTAACTTTTAGCATC 

ATGCCTTCAGTATACTAACACTCATATGCTCAGTTTTC 

GCTTTAGTAAAftATCCCTCATTCGAGCCTTTC^ 

ACACTATAGCCTCACAAACCTGTTATTCCAGTGTAATCTGCAGTC 

CTTATTTGCACAGTTTTTGCGTCTTGTTTGCT 

TTTTAATTTGTGATGTCACTTGACCCCATTTATGTGTAG 

CAC^TAAGATACATACTTGTGTGCAGAAAGTATCTTCCT^ 

TAATGAGGGAAATCTTTATATTCTGTATAAAAACAAAAGCAAATTTATATACTAAAATCATT^ 
AATTTAAGTTGTTTTCAAATAAAAATTAAAATG C^TTTCTGATATGCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAA 

Human SNX3 mRNA sequence - varl 1 (public gi: 3126978) (SEQ ID NO: 172) 

GCGGCACAGCTACAG03AAATGGCGGAGACCGTGGCTGA 

CTGAATGACGCCTACGGACCCCCCAGCAACTTCCTCGAGATCGATGTGAGCAACCCGCAAACGGTGGGGG- 

TCGGCCGGGGCCGCTTCACCACTTACGAAATCAGGGTCAAGACAAATCT^ 

ATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAAT^^ 

GTAGTTCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCACTTCCCTTTTAGAGGAGATGATGGAATATTTG 

ATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAACAAGGTCGCTGGTCA 

GGCACAGAACX5AACGTTGTCTTCACATGTTTT^^ 

AAAATAAGACATGCOTGAAATTTGGCAAGAAGGGGCAAAAACCGTGA^ 

GTGAAGAAGTTCTAACTTTTAGCATGCTGCACAGAAACTGGTATAACATGCCTTCAGTATACTAACACrC 

CATATGCTCAGTTTTGTTTTGTTTTGGCAGTTGACA&GAAGTTAATTTGCT 

CCAGCCTTTCTATATAAATAGCTCCTTCCT 

GTTAATCCAGTGTAATCTGCAGTGTCGTAACTAAAGTACTGGCTTGGTCCTAATTG 

Human SNX3 protein sequence - varl (public gi: 23111041) (SEQ ID NO: 287) 

MAETVAI^TRRIiI TKPQNLNDAYGP PSNFLE IDVSNPQTVGVGRGRFTTYE I RVKWVPPLPGKAFLRQLP 
FRGDDGIFDDNFI EERKQGLEQFINKVAGHPIAQNERCLHMFLQDEI IDKSYTPSKIRHA 

Human SNX3 protein sequence - var2 (public gi: 23111043) (SEQ ID NO: 288) 

•MAETVADTRRLITKPQNLNDAYGPPSNFLE IDVSNPQTVGVGRGRFTTYE I RVKTNL P I FKLKESTVRRR 
YSDFEWLRSELERESKPCLRMTSEARSHGRTWCAQNDEKLFCD 

Human SNX3 protein sequence - var3 (public gi: 15779012) (SEQ ID NO: 289) 

KAETVADTRRLITKPQNLl^AYGPPSNFIjEIDVSNPQTVGVGRGRFTTYEI RVKTNLP I FKLKESTVRRR 

YSDFEWLRSELERESKWVPPLPGKAFl^QLPFRGDDGIFDDNFIEERKQ^ 

LHMFLQDEI IDKSYTPSKIRHA 

Human SNX3 protein sequence - var4 (public gi: 3126979) (SEQ ID NO: 290) 

MAETVADTRRLI TKPQNLNDAYGP PSNFLE IDVSNPQTVGVGRGRFTTYE IRVKTNLP I FKLKESTVRRR 
YSDFEWLRSELERESKVWPPLPGKAFLRHFPFRGDDGIFDDNFIPERKQGLEQFINKVAGHPLAQNERC 
LHMFLQDEI IDKSYTPSKIRHA 

Human SNX3 pray sequence - varl (SEQ ID NO: 173) 

GCCGCCATGGNAGTACCCATACX3ACGTACGAGATTACGCT 

CCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGCGGCGGCGGCGGCGGCTGAAC 

AGGGAGCCCGCGGCGGCGGCAGCAGCTACAGCGAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTGAT 

CACCAAGCCGCAGAACCTGAATGACGCCTACGGACCCCCCAGC7VACTTCCTCGAGATCGA 

CCGCAAACGGTGGGGGTCGGCCGGGGCCGCTTCACCACTTACGAAATCAGGGTCAAGACAAATCTTCCTA 

TTTTCAAGCTGT^AAGAATCTACTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCGAAGTGAATCAGA 

AAGAGAGAGCAAGGTCGTAGTTCCCCNNGCTCCCTGGGAAAGCGTTTTT^ 

AGATGATGGAATATTTGATGACAATTTTATTGAGGAAAGAAAACAAGGGCTGGAGCAGTTTATAAAC^G 

GTCGCTGGTCATCCTCTGGCACAAAACGAACGTTGTCTTCACATGT 

AAAGCTNTACTCCATCTAAAATAAAACATGCCTGAAOT 
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ATTGANAGCCCOSrN*m^ 
CCTNNNTNCCNANTTTNNTTTGNN^^ 

Human SNX3 pray sequence - var2' (SEQ ID NO: 174) 

CCGCCATGGTAGTACCCATACGACGTACC^^ 

CCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGCCGGGGC^ 

CCAGCTACAGCXaAAATGGCGGAGACCGTGGCTGACACCCGGCGGCTC 

TGACGCCTACGGACCCCCCAGCAACTTCCTCGAGATCGATGTG 

CGGGGCCGCTTCACCACTTACGAAATCAGGGTCAAGAC^ 

CTGTTAGAAGAAGATACAGTGACTTTGAATGGCTGCX5AAGT 

TCCCCCGCTCCCTGGGAAAGCGTTTTTGCGTCAGCTTNCTTTTAGAGGGGA 

AATTTTATTGAGGAAAGAAAACAAGGGCTGGANCANTTTATNAACAA^ CTATTCCTNAAA 

GTGTANGACTNCTTTAAGTGACrrACTTTTNTTTANATGTNAANNNAACTC^ 

CNTTTCCTANOTTTNATTTNTTTAA 

Unigene Name: SRA1 Unigene ID: Hs . 32587 Clone ID: 3GD_19 
Human SRA1 mRNA sequence - varl (public gi: 10436964) (SEQ ID NO: 175) 

ACGTGAAGCCGGGTGAGCGCAGCCGGCGGGCTAGGGCACTAGGTCGTCGCCCCGGCOT 

GTTGCGGCGCTTAGTATGGACCCTCTGTCTCCCCCAGCCCCAGTATAAGCTAACAGTGGAGTTCCGGGCT 

CGCTTCACACATCCCTCGCCTCCGCAGGCAACAA.GGAACGCGGCTGGAACGA 

CGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTAGCCGCACCCCAGGAT 

GGATCCCCC^GAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCC^T^ 

GTAAGGTTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCG'^^ 

CGAGTCTGAGGCTCGACTGATGGAGGATGTGCTGAGACCTTTGGAACAGGGATTGGAAGA^ 

CACACAAGGAAGCAGGTATGTGATGACATGAGCCGACGC^ 

GAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCCACCG 
CGCAGCAGATGACATCCACCGCTCCCT(^TGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTAG^ 
GTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAGAAATCTGCAG 
CCACAGCTGAGAAGAACCATACCATACCAGGCTTCCAGCAGGC 

CCGGACACCATCTCCTATGCCTTGGAGACCTTCTGTCACTTGGCTCCCTTCTTACCACCACCAAGACTGT 
CCCACTGGGCCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGG^ 

TAACATGCATTTCAATAAAAACATCTCTGCGGTGGGCCTTGGGTAGGAGAGATGAACCCTTCCGGTGCC^ 
AGCTAGTCCCCTCTGGTGTCCTCGACTGCCCTGCTCCCTGTGTATC^^ 

CATTCATCAGGAAGGGATCTGCTGGGTAAAGTCAGACTACTGCCTACCACTTTTTCCCAAAGTAGACTG 

AAGCACATCCTGTGCTGGGCGGAGCAGCTGTGTTTGGATGGTTTCATTTCAGGATGAGAACAGACTCAAA 

TAGAACGGGGAGACTTTTCCCTCAACAAAAGGAAAGACAGTCCT^^ 

CTACTGTTACAGAGATTAGAACCACA.TTGAGTGGGGTTTTCTGTGTAAATCGAAGGAGAAAAAGACCAGA 
TTACTGAGATTGGGGATTGTAACTCTGACTTGCCAAACAAACTC 

Hxunan SRA1 mRNA sequence - var2 (public gi: 9930611) (SEQ ID NO: 176) 

TCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGCC 
GCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATCCTCCACGGATTCCGGCAACCCGC 
CTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGC 
•CAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGC 
CGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTATCAAGAGGGTCGC 
OTCACCCCAGGATGGATCCCCCAGAGTCCCCGC^TCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCT 
CCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCA 
CAAGTTTCCC^GTCGAGTCTGAGGCTGTGATGGAGGATGTGCT 

CTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTG 

TGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCC 

ACCGGTGGGACGCAGCAGATGACATCCACCGCTCCCTCATGGOT 

<3ATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAG 

AAATCTGCAGCCACAGCTGAGAAGAACCATACCATACCAGGCTO 

CCCAGACT 

Human SRA1 mRNA sequence - var3 (public gi: 9930613) (SEQ ID NO: 177) 

TCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGCC 
GCAGGGTCCCCACAGCXSGCTCCCGACGGTTGTGAACC^GCATCGATCCT 

CTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGJ^GGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGC 
CAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGC 
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CGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAG^ 

CGCACCCCAGGATGGATCCCCC^GAGTCCCCGCATC^GAGACTTCTCCTGGGCCTCCCCC^ 

CCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCTGTG 

CAMTTTCCCAGTCGAGTCTGAGGCTa^CreATC 

AGACTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGACATC^ 

CAGTGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAA 

GCCACCGGTGGGACGC^GCAGATGACATCC^CCGCTCCCTCATGGTTGACCATGTGACTGAG 

GTGGATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCT 

GAGAAATCTGCAGCCACAGCTGAGAAGAACCATACCATACCAGGCTTC 

TTCCCCAGACT 

Human SRA1 mRNA sequence - var4 (public gi: 4588026) (SEQ ID NO: 178) 

CGCTTGGCGGAGCTGTACGTGAAGCCGGGCAACAA 

ACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTA 

TGGATCCCCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCTCCACCTCCTTCA 
AGTAAGGCTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGC^ 
TCGAGTCTGAGGOTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGAC^ 
CACAAGGAAGCAGGTATGTGATGACATGAGCCGACGC^ 

AAGTTGTCAATACCTGTAAAGAA^GAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGC 
CAGCAGATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTAGGAGT 

TAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTT<^GAG 

ACAGCTGAGAAGAACCATACCATACCAGGCTTCCAGC^GGCTTCATAATCCTCGGTTCCC 
GGACACCATCTCCTATGCCTTGGAGACCTTCTGTCACTTGGCT 

CACTGGGCCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGGAGTTCATGTGTTACTCATA 
ACATGCATTTCAATAAAAACATCTCT'GCGGTGGTG 

Human SRA1 mRNA sequence - var5 (public gi: 25123254) (SEQ ID NO: 179) 

GGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACCCCGCCGCAGTTOT 

GCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTCGCCGCACCC^ 

CCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTGCCCCAATGGGGCCTCCACCTCCTTCAAG 

CTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCGAGAAGTTTCCCAGTCGAGTC 

TGAGGCTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGACTGCCGTGGCCACACAAGG 

AA.GCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAGTGGGCTGGAGGAAAGTTGT 

CAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCCACCGGTGGGACGCAGCAGA 

TGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTAGGAGTTAAAAGA 

TTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAGAAATCTGCAGCCACAGCTG 

AGAAGAACCATACCATACCAGGCTTCCAGCAGGCTTCATAATCCTCGGTTCCCCAGACTCACCGGACACC 

ATCTCCTATGCCTTGGAGACCTTCTGTCACTTGGCTCCCTTCTTACCACCACCAAGACTGTCCCACTGGG 

CCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGGAGTTCATGTGTTACTCATAACATGCA 

TTTCAATAAAAACATCTCTGCGGTGAAAAAAAAAAAAAAA 

Human SRA1 mRNA sequOT.ce - var6 (public gi: 18027813) (SEQ ID NO: 180) 

GCAGGCACTAAGCTGGGCACTGGGAATGTAATAAAATAGTCAAGGTCCCACCTTCTAAGACTGTCCGACA 
GGGAAACGAACAAGAGTCAAATAAGGCAGAAGATGTGATGTAATACACCTACGAAATCTCAGAGGGTTGT 
AGGGTCGTGGGAGCTCAAGTGAGACACTTAACCTGGCCTGAGACATTCCAGAAGGCCTCCTGAAGAACTG 
ACATCTGAACTGAGAACTGAAGGAAGATGAGTACTAGTGAGGCTACCGGACGTGAATGTGGAGATTGTGC 
AGGGC^VATGCAAGAGGAGGCTGTAGAAGTCAACCTGGCTAGATCACAGCGGGGTGTATGTGGGGCAGGAG 
CTTCTTTGTTTGAATTTGCTCCTGAGAGGAT<^GG 

TTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGCCGCA 
GGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATTCTCCACGGATTCCGGCAACCCGCCTG 
GCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGCCAA 
GCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGCCGC 
AGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTAGCCGC 
ACCCCAGGATGGATCCCCCAGAGTCCCCX5CATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCTCCA 
CCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCACAA 
GTTTCCCAGTCGAGTCTGAGGCTCGACTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGA 
CTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAG 
TGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCC 
ACCGGTGGGACGCAGC7VGATGAGATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTG 
GATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAG 
•AAATCTGCAGC CACAG CTGAGAAGAACCATACCATACCAGG CTT C CAGCAGGCTTCATAATCCTCGGTTC 
CCCAGAOTCACCGGACACCATCTCCTATGCCTTGGAGACCTTCTGTCACTTGGCrCCCTTCTTACCACCA 



Figure 36 part - 96 



141/202 



WO 2004/078130 PCT/US2004/006308 

CCAAGACTGTCCCA£TGGGCC 

GTGTTACTCATAACATGCATTTCAATAAAAACAT^ 

Human SRA1 mRNA sequence - var7 (pubUcgi: i6549596)(SEQlDNO: 181) 

TTATAGCAAAATCAGTGCAAATAAAAATCCCTCAGTGACCTC^ 

CAGGGCTGGGGGCTAACACCCTGTGTGAGATGAGTGTCTTTGTGT 

GTAGTCACATGACAGCATGGGTGTGATGGAGATCTGACT 

CCTGTGCCAGGCACTAAGCTGGGCACTGGGAATGTAATA^ 

CCGACAGGGAAACGAACAAGAGTCAAATAAGGCAGAAGATGTGATGTAATA 

GGTTGTAGGGTCX5TGGGAGCTCAAGTGAGACACTTAACCTGGCCTGAGAC^ 

GAACTGACATCTGAACTGAGAACTGAAGGAAGATGAGTACT^^ 

TTGTGCAGGGCAATGCAAGAGGAGGCTGTAGAAGTCAACCTGGCTAGATCACAGCGGGGTGTAT 
CAGGAGCTTCTTTGTTTGAATTTGCTCCTGAGAGGATGAGGCCTCCT 
ACCTCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGT 
GCCGCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCA 

CGCCTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCT 
GGCC^GCXSGAAGTGGAGATGGCGGAGCTGTACGTGAAGCC^ 

CGCCGCAGTTCTCATACGGGCTGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCr^ 

AGCCGCACCCCAGGATGGATCCCCCAGAGTCCCCXSC^^ 

CCTCCACCTCCTTCAA.GTAAGGCTCCCA.GGTCCCCACCTGTGGGGAGT 

CCACAAGTTTCCCAGTCGAGTCTGAGGCTrCGACTGATGGAGG 

GGAAGACTGCCGTGGCGACACAAGGAAGCAGGTATGTGATGAGAT 

GAACAGTGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTT 
CAAGCCACCGGTGGGACGCAGC^GATGACATCCACCGCTC^ 

TCAGTGGATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTC^GAGGAGGCAGCCAAT 
GAAG AGAAATCTGCAG C CACAGCTGAGAAGAACCATACCATAC CAGGCTT CCAGCAG GCTTCATAATCCT 
CGGTTCCCCAGACTCACCGGACACCATCCCCTATGCCTTGGAGACCTTCTGTCACTTGGCTCCCTTCTTA 
CCACCACCAAGACTGTCCCACTGGGCCTGACCCACCTATGAGGGAAGAAGTCCCACCTGGGCCAGAGGGA 
GTTCATGTGTTACTCATAACATGCATTT 

AACCCTTCCGGTGCCAAGCTAGTCCCCTCTGGTGTCCTCGACTGCCCTGCTCCCTGTGTATCTGCAAACC 

TCTGTTCTCCCTTCTCCATTCATCAGGAAGGGATCT^ 

TCCCAAAGTAGACTGAAAGGAXIATCCTGTGCT 

TGAGAACAGACTGAAATAGAACGGGGAGACTTTTCCCTCAACAAAAGGAAAGACA 
TATCACCCTTGAGATACTACTGTTACAGAGATTAGAACC 

Human SRA1 mRNA sequence - var8 (public gi: 9930609) (SEQ ID NO: 182) 

TCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTAGGGAGTTGC 

GCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATCCTCCACGGATTCCGGCAACCCGC 

CTGGCCCTGGACGTGTCTCZAACTGGCCCGCGTGAGGGGCCGCCCCGGAAATGACGCGCTGCCCCGCTGGC 

CAAGCGGAAGTGGAGATGGCGGAGCTGTACGTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGC 

CGCAGTTCTCATACGGGCrGCAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTAGC 

CGCACCCCAGGATGGATCCCCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCT 

CCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGCGTGGAGCCCA 

CAAGTTTCCCAGTCGAGTCTGAGGCTGTGATGGAGGATGTGCTGAGACCTTTGGAACAGGCATTGGAAGA 

CTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGACATCAGCCGACGCCTGGCACTGCTGCAGGAACAG 

TGGGCTGGAGGAAAGTTGTCAATACCTGTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCC 

ACCGGTGGGACGGAGCAGATGACATCCACCXjCTCCCTC^ 

GATGGTAGGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAAGAG 

AAATCTGCAGCCACAGCTGAGAAGAACCATACCATACCAGGCTC 

CCCAGACT 

Human SRA1 protein sequence - varl (public gi: 9930610) (SEQ ID NO: 291) 

OTRCPAGQAEVEMAELYVKPGNKERGWN 

GPPPMGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEAVMEDVLRPIiEQALEDCRGHTRKQVCDDISRR 

LALLQEQWAGGKIjSIPVKKRMALLVQELSSHRWDAADDIHRSL 

EEAANEEKSAATAEKNHTI PGFQQAS 

Human SRAl protein sequence - var2 (public gi: 25123255) (SEQ ID NO: 292) 

MGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEAVMEDVLRPLEQAIiED^ 

QEQWAGGKLS IPVKKRMALLVQELS SHRWDAADD I HRSLMVDHVTEVSQWMVGVKRL IAEKRSLFS EEAA 
NEEKSAATAEKNHTI PGFQQAS 
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Human SRA1 protein sequence - var3 (public gi: 9930614) (SEQ ID NO: 293) 

MTRCPAGQAEVEMAELYVKP^ 

GPPPMGPPPPSSKAPRSPPVGSGPASGVEPTSFFVESEARIJlEDVLRPIiEQAU 

RIiAIjLQEQWAGGKLSIPVKKRMALIiVQELSSHRWDAA^ 

SEEAANEEKS AATAEKNHTI PGFQQAS 

Human SRA1 protein sequence - var4 (public gi: 9930612) (SEQ ID NO: 294) 

MTRCPAGQAEVEMAELiYVKPGNKER^ 

GPPPMGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEAVMEDVLRPLEQ^ 

I^AL^EQWAGK3KIjSIPVKKRMALIjVQ^ 

EEAANEEKSAATAEKNHTI PGFQQAS 



Unigene Name : SYNE 1 Uni gene ID : Hs. 416719 Clone ID : 3GDJL38aa2938 
Human SYNE1 mRNA sequence - varl (public gi: 21753084) (SEQ ID NO: 183) 

GTACAAAAACGAACTTTCAGAAAATGGATCAACTCT 

ACGATCTTTTTGAAGACATGAAAGATGGTGTTAAACTGCTTGCCCTTCTGGAGGTCCTC 

ACTGCCTTGTGAACAAGGACGCCGGATGAAGCGAATCCATGCTG^ 

TTCCTCGAAGGAAGAAAGATTAAATTAGTCAACATTAACTCCACCGATATAGCT 

TAGCTCTTGGATTGATGTGGACCATTATTCTATATTTCGAGATTGAAGA 

GCTCCAGTCTTTGTCCAGCAGCGCATCCTCCGTGGAGAGCATAGTTAGCTCTGAG^ 

AGTAAACGGAAGGTGACCACCAAGATCCAAGGAAATGCTAAGAAGGCCT^ 

CAGCTGGCAAGCAGACTGGAATAGAAGTAAAAGATTTTGGGAAGAGTTGGAGAAGCGGGGTTGCCT 
TTCAGTTATTCATGCCATTCGACCGGAATTGGTGGACTTGGAGACAGTGAAAGGCAGATCCAACCGAGAA 
AATTTGGAGGATGCTTTCACTATCGCTGAAACAGAACTGGGGATCCCAAGACTGCTAGATCCTGAAGACG 
TTGATGTGGATAAACC^GATGAGAAATCTATTATGACCTAT^ 

CATCCACAATGCAAGCACTGATGGGCAAGAGGATGATGAAATACTTCCAGGTTTCCCAT 
TCTGTACAAAATTTTAAGAGAGAAGACAGAGTAATTTTT^ 

AGAGAGATTTGACAAGAGCACAGATGGTGGAATCAAATTTACAGGATAAATATCAGTCATTTAA 
CAGAGTTCAATATGAAATGAAGAGGAAACAGATTGAACATTT^ 

TTGTC^CTTGACCAAGCATTGGTAAAACAATCTTGGGATAGAGTGACCTCCAGGCTCTTTGACT 
TACAGCTTGATAAATCTCTTCCTGCACCTCTC^ 
CCTGAGAGAGGAAATAACCGTTGAAGAGGTCCACGAGGAAA(^ 
CAACATAAGGATCTGCTTCAAAACACGGATGCCCACAAAAGAGC^ 

CTGTTAACGGGATTCCAGTGCCACCTGATCAATTAGAGGACATGGCCGAGAGGTTTCATTTTGTTTCCTC 

CACATCAGAGCTACACCTAATGAAAATGGAATTTTTAGAATTAAAGTACCGTCTGCTOT 

CTTG CAGAGTCAAAG CTGAAGTCTTGGATCATTAAGTACGGGAGGAGAGAGT CAGTGGAGCAGCTTCTAC 

AAAACTACGTGTCTTTTATAGAAAATAGCAAGTTCTT^ 

ACAGACAGCTGAGATGTATGTCAAAGCAGATGGTTCAGTGGAAGAAGCTGAGAATGTGATGAAATTCATG 
AATGAAACCACCGCTCAGTGGAGGAATCTCTCAGTAGAAGTGAGGAGTGTGAGGAGCATGCTGGAAGAAG 
TGATCTCTAACTGGGATCGCTATGGC^TAC^GTGGC 

AATGCTCAATCAATCAGAAAATGCGAAAAAGGATTTTTTTCGAAATTTACCTCATTGGATTCA 

ACTGCCATGAACGATGCTGGCAATTTTCTAATTGAAACCTGTGATGAGATGGTTTCCCGTGACCTGAAGC 

AGCAATTACTGTTGCTAAATGGGCGGTGGAGGGAGTTGTTTATGGAAGTCAAGCAATATGCTCAAGCTGA 

TGAGATGGACAGAATGAAGAAGGAATACAGAGACTGTGTTGTTACCCTGTCTGCTTTTGCAACGGAAGCC 

CATAAGAAACITTCTGAACCCTTAGAAGTCTCTTTTATGAATGTCAAGCTATTAATTCAAGACTTGGAGG 

ATATTGAGCAGAGGGTGCCTGTGATGGATGCCCAATACAAGATAATTACAAAGACAGCACACCTC^ 

CAAAGAAAGCCCCC 

Human SYNE1 mRNA sequence - var2 (public gi: 22332201) (SEQ ID NO: 184) 

AGCGGCTGCCTCCTTGTTGAGTGCTGCAAAGGCCTGGAATTCATTTATGACAGAATAGATCTAGAAAAGT 

CCAAGCATGTTTTCTAGAGTGGTGTAGCCCTGTGCTGCCTCCAGTGAAGAGTCTCTTGGTGTTGGCTTCG 

TGCTTCCGGAGGGACCATGGCAACCTCCAGAGGGGCCTCCCGGTGTCCTCGGGATATCGCCAATGTGATG 

CAGAGGCTGCAAGATGAGCAAGAGATAGTACAAAAACGAACTTTCACAAAATGGATCAACTCT 

CCAAGCGGAAACCTCCAATGGTGGTGGACGATCTTTTTGAAGAGATGAAAGATGGTGTTAAACTGCTTGC 

CCTTCTGGAGGTCCTGTCTGGGCAGAAACTGCCTTGTGAACAAGGACGCCGGATGAAGCGAATCCATGCT 

GTGGCT AACAT TGG CACGG CAC TC AAGTT C CT CGAAG GAAGAAAG ATTAAAT T AGT CAACATT AACTC CA 

CCGATATAGCTGATGGCCGACCCTCAATAGTTCTTGGATTGATGTGGACCATTATTCTATATTTCCAGAT 
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TGAAGAGTTGATZCAGCAACCTGCCCCAGCTCC^TCTTTGTC 

gttagctctgagactcccagccaagcaagtaaa^ 

aggctttattaaagtgggttcagtacacagctggcaag^ 

gagttggagaagcx3gggttgcctttcattcagttattcatg 

acagtgaaaggcagatccaaccgagaaaatttggaggatgctttcactatcgccgaaacagaact 
tcccaagactgctagatcctgaagatottgatgtggataaaccagatgagaaatct 
agcccagtttctgaaagattatcctgacatccacaatg 
cttccaggtttcccatcttttgcaaatt 

aaatgaaagtttggatagaacaatttgagagagatttgacaagagcac^ 

ggataaatatcagtcatttaagcacttcagagttc^ 

atacaaccattacacagagacggtaaattgtcact^ 

tgacctccaggctctttgactgg^tatac^^ 

tgc ctggctgtacagagcggaggtggc cctc^gagaggaaataac 

gcaaacacgatacaacggaaacttgagc^^cataaggatc 

CMTCCATGAAATCTACCX3GACCAGGTCTGTTAAC 
GGCCGAGAGGTTTCATTTTGTTTCCTCCACATCAGAG 

AAGTACCGTCTGCTATCACTGCTGTTCTTGCAGAGTCAAAGCTGAAGTCTTGGATCATTAAGTAC 
GAGAGAGTCAGTGGAGCAGCTTCTACAAAACTACGTGTCTC 
TATGAGGTGACATACCAGATCTTGAAACAGACAGCTGAGATC 
AAGCTGAGAATGTGA^GAAATTCATGAATGAAACCACCGCT 

GAGTGTGAGGAGCATGCTGGAAGAAGTGATCTCTAACTGGGATCGCTATGGCAATACAGTGGCTAGTCTG 

CAAGCCTGGCTAGAGGATGCTGAAAAAATGCTCAATCAAT 

ATTTACOTCATTGGATTCAGCAGCATACTGCCATGAACGATGCTGGCA 

TGAGATGGTTTCCCGTGACCTGAAGCAGCAATTACTGTTGCTAAATGGGCGGTGGAGGGAGTTGTTTATG 

GAAGTCAAGCAATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGGAATACACAGACTG 

CCCTGTCTGCTTTTGCAACGGAAGCCCATAAGAAACTTTCTGAACCCTTAGAAGTCTCTTTTATGA^ 

CAAGCTATTAATTCAAGACTTGGAGGTGAGGGGTTTCTGAAT 

AGGAAAGATCAGCAAGTTTATTCAGATCATTGCAAAAGCT 

ATGTCTGTATGTCCCAATTTGCACCTGTCAGAAAAAAATG 

ATATAAAGTAACTTCAAATTGTTAAAAAAAAAAAAAAGAAAAAAAAAAAAA 

Human SYNE1 mRNA sequence - var3 (public gi: 28192627) (SEQ ID NO: 185) 

AGTACGCGGGAGTCTTAAAACGGAAGAAGAAAAAGAAGCAGTTG 

TGAGTGCTGCAAAGGCGTGGAATTCATTTATGACAGAATAGATCTAGAAAAGTCGAAGCATGTTTTCT 
AGTGGTGTAGCCCTGTGCTGCCTCC^TGAAGAGTCTCT 

TGGCAACCTCCAGAGGGGCCTCCCGGTGTCCTCGGGATATCGCCAATGTGATGCAGAGGCTGCAAGATGA 
GCAAGAGATAGTACAAAAACGAACTTTCACAAAATGGATCAACTCTC^ 

ATGGTGGTGGACGATCTTTTTGAAGACATGAAAGATGGTGTTAAACTGCTTGCCCTTCTGGAGGTCCTGT 

CTGGGCAGAAACTGCCTTGTGAACAAGGACGCCGGATGAAGCGAATCCATGCTGTGGCTAACATTGGCAC 

GGCACTCAAGTTCCTCGAAGGAAGAAAGATTAAATTAGTCAACATTAACTCCACCGATATAGCTGACGG 

CGACCCTCAATAGTTCTTGGATTGATGTGGACCATTATTCTATATTTCCAGATTGAAGAGTTGACCAGCA 

ACCTGCCCCAGCTCCAGTCTTTGTCCAGCAGCGCATCCTCCGTGGAGAGC^ 

CAGCCCACCAAGTAAACGGAAGGTGACCACCAAGATCCAAGGAAATGCTAAGAAGGCTTTATTAAAGTGG 

GTTCAGTACAQ^GCTGGCAAGC^GACTGGAATAGAAGTAAAAGATTTTGGGAAGAGTTGGAGAAGCGGGG 

TTGCCTTTCATTCAGTTATTCATGCCATTCGACCGGAATTGGTGGACTTGGAGACAGTGAAAGG 

CAACCGAGAAAATTTGGAGGATGCTTTCACTATCGCCGAAACAGAACTGGGGATCCCAAGACTGCrAGAT 

CCTGAAGACGTTGATGTGGATAAACCAGATGAGAAATCTATTATGACCTATGTAGCCCAGTTTCTGAAAC 

ATTATCCTGAGATCGAGAATGCAAGCACTGATC 

TTTTGCAAATTCTGTACAAAATTTTAAGAGAGAAGACAGAGTAATTTTTAAGGAAATGAAAGTTTGGATA 
GAACAATTTGAGAGAGATTTGACAAGAGCACAGATGGTGGAATCAAATTTACAGGATAAATATCAGTCAT 
TTAAGCACTTCAGAGTTGAATATGAAATGAAGAGGA 

AGACGGTAAATTGTCACTTGACCAAGCATTGGTAAAACAATCTTGGGATAGAGTGACCTCCAGGCTCTTT 

GACTGGCATATACAGCTTGATAAATCTCTTCCTGCACCTCTGGGCACCATAGGTGCCT 

CGGAGGTGG CCCTGAGAGAGGAAATAACCGTTCAACAGGT CCACGAGGAAACAGCAAACACGATACAACG 

GAAACTTGAGCAACATAAG 

Human SYNE1 raRNA sequence - var4 (public gi: 21734187) (SEQ ED NO: 186) 

GGGACACAGTGAGAAACCACTGTATGAAGTGTTTGGGGCTCAGATGGCAGTGTGGTGGCAGGACACAC^ 

GCAGGGGCAACAAAGCCGGAGTCCTGGGAGAGACTTCGGGAAAGAAACCATGGAGCAGAGTCCAGA 

GAATCGCAGTTGGTCAGGTGGGCTCGGAAGTGCTTAGAGGAAGAAAGGAGGGGAAGCCTCCCTGCTTTGT 

GATTGGCCTGGTGCTTTTGATCATTGGCAGTTTGTCTGGAATGACAGGCATGGAATGGACAAAGTGGAGA 

AGAGCCTGGCTGTGAAGCAGCCTGTTCACCATGGAAAAGAGCTGTGGCTCCTATCCTGAAGGTCGTGGAG 
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ccacagcaggatctgcggagggaggtgctgggatcctccct^ 

atctttctggataccacagggagaagggcatattcggcg^^ 

cagacagaagtagggtggtggcctaaactaggggaagc^ 

gagaaatcacatagagaagactcctccagaaotctcagt^ 

gctgggcatcttggtgaaagatgcaggtggtc^ 

ggtgcaggtgggcaagggaggtggggctgtgagagcagggagg^ 

tcttgagatacctgcagaatatccaaatgcaaaagtccagtc^ 

CAGAAATGCAGATTGGGTAGGTATTCACATGTAAATGGCAATGGTCCTGGAGTGAA 

CAGGAAGAGTGTGTGAAGGAAAACAAGAAGGACCACCACCCAAGC(^ 

GAGAAACAGAAACTCTGTAAGGAAGGTGAATAAAAATAGAATAAAGAGTTGGAGGCTGA 

TGGAAATGTATCTCATACATTCTGTCAAAGGACAT 

CAGATTCCCAAGGGATGACACTGTTCTAAAAAGAAAATGATTTCT 

AAGGCCTATTAGTCAGGCATATGGCATCTGAAGCAGAGCT 

TGTTGAGCTCTGAGATGGGACTGTGCAAATTGAGATGGGTTGTGCGTGG^ 

CAAAGACTTAGTACAAGAAAGAAAATAAAATATTAATAA 

AATGTAGTGTTAAAATTAATTTTATAAGTTTCTTTTTAGATTTCTAATGTGGCT 

TACATATATAGTTCACATAGAAATATATGGGACAGCGCTGCTCTGGAGTCTGGGCTGAAATCTCAGTTCT 

GCCATGTACTTTCTGTTTAAACTTAGATAAGGAACCTAATTC 

AATGGGAATAAGATTCCCAGGTACCCTATAGGGTTTCT 

AGTCTAGTTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAACCTCCGCCTCCCCG 
GTTCAAGCGATCCTCCTGCCTCAGCCTCCGATGTAGCTGGGATTACAGGTACCTG 

AATTTTTGTATTTTTAGTAGAGATGGGGTTCCACCATGTCGGCCAGGCTGGTCTCGAATTCCTGACCTCA 

GGTGATCCACCCGCCraGGCCTCCCAAAGTGCTGGG 

TGAATTCTTATGATTGATTTCAGGAAAGCTTTGGTGATC 

TTCCAGAACTTAGCTGTCAACTTAGACTGTGCTTTTTTTGCAGATGTAATGATCCCCGA 

CCTATGTAAAACTCACAGAAAATGC^TCAAAAATACCTCCGGTAGGCACAACAA 

CCTCCCTCCCCCGTAGAGCAATCCTCCTCTCTTAGCACATOTGCTGTCTCTCTCCCAGC^ 

AGCATGTTAAGGCACAGCCTTCCTCTCTTACTGCTGTACTAGAAAAAACAGCTGGTTAAATCCACACCGA 

GAATAAGATTTGACTAATCGAGCGAAATAAAATAACTTCTCA^ 

GGTATAGACCTCTGATGTCCATTAACTGCAGAAAATA 

CCTAGTGTTCTACGTGGTGTGGTAATTTGAGCTTCACTGCATGCAGACCTACCTGTGGCTGGAGACT 
GGTGGAGGCTCTGGTCCCAAGTCCGCCAGCCTC 

CTCATCTCAAGAATCCCGTATGAGACAAGGGGTGAGATCAGATTTCAGCTCTAAAAAATATATGTAATTT 
TAATTTAAGAGGTCGTAAAAGATAATTTGAAATGAAAAATGTATTTCACGGTATGCTGAGCCATAGATAA 
GAACAGAACTATTCCTGAAACAAGAAG/y^TTAAAGAAAGAAAAATGGAACTAGTTTTGCTTAGTGTTGTG 
TTAGACAAACTGTAGTGCAGGAGTTCAGGATCAGTGATGTGGGATGTCGGCGGGGAATAGAGTTTGAACG 
CAGTGATATGATATTGAATCACGGAGTTACTAGTTTACGCTTCAGTTTTTGAAGAAAATCAAAAGGACAG 
AAAGCAAAGTAACATTACTGAGAGGGTGATTCCCAGGGAGGGACCTCTCCTAGGTGTATCTAGAAGGCCT 
TTTTTTAGA7^CAAATAAAAACATTTAATAAAGCTTACTAATATTTGTTCTGCTTTCACCCCATGCTAGC 
TTCACTGATGATCAAATGTTCCTGTGTAGTTCGAAGACT^ 

TCAGTAATTTTAGAAAGAAATGTTACAACTTTGAGAGGAGAATGAGCCAGAATTCTAGGCTATGAGTAAG 
AACCTGCCTAGATGGAAATGTTAAAATCTTAGCTTTCTCCTGGTTTTGTTTCAGATCTTAGATAAAAAGC 
AAGTCGTTGCTAGTTTGATATCTCTGTATATATCTATTCTGAGGCACTCTTTTCTTGATTAATGAATTTA 
TGCCTTCAGTAAATGATGCAGCAACCTGAGCCTTCCGTGACACTATCTTCCCCTGAGGTGCATGAAGAAA 
AATCAGAGGGAGGATCOTCCCCTGCTCACTAAGCGATAGCAGAAAGAACATGAGAAAAAGAACAGCTTTC 
TCCTTACTGAGATGCAGTAGAGACGATTGCAGATTTTTAGAA 

GATCAGGGGTTCAGOTGGAAGAGGGTCAGGTGCACGAAGCTGTTCAAAACGGACTGGAGAGCCGTTTTGC 
GACGTCCGATCTGCTAGGGCTCTCAGTCAAGCACCTTCATTGGTTTGGCCATTTTAATGTCTAACCCACC 
GGCACGATGGTACTAT^TTTGCTTTGTAATTACCCACACCTGCCATTTCTATGCTGCTGTAACTGAAATC 
ATTT CTGAAACTTCTCCTTGAATATCAAGCTTTAAATAAGTCAAATAGTTTGTC CAGTAAAGATTCTTAT 
GGTTGCCACCGCAGGGGACCACAGTGCCCTAGAGTC^CAGATCCGAGAACTGGGCAAAGCCCTGGATGAT 
AGCCGCTTTCAGATACAGCA/^CCGAAAATATCATTCGCAGCAAAACTCCCACGGGGCCGGAGCTAGACA 
CC^GCTAC^AAGGCTACATGAAACTGOTGGGCGAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGA 
GCACAAACTGAAGGAGGAAGAGGAGAGCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAAACCCAAACG 
G CTGGTGTGATTGACCGATGGGAGCTTCTCCAGGCC CAGGCATTGAGCAAGGAGTTGAGGATGAAGCAGA 
ACCTCCAGAAGTGGCAGC^GTTTAACTCAGACTTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGA 
GGAGTTGGAACAGCTCCAGCGTCTGGAACTCAGCACTGACATCCAGACCATCGAGCTCCAGATCAAAAAG 
CTCAAGGAGCTCCAGAAAGCTGTGGACCACCGCAAAGCCATCATCCTCTCCATCAATCTCTGCAGCCCTG 
AGTTCACCCGGGCTGACAGCAAGGAGAGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGrCTG 
GGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGT 
TTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGGAGAACATTGACAGAAGGAAAAATGAAATTGTCC 
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-CTATTGATTCTAACCTTGATC 

CTCTGGCAGCCTCCAGTTATTTTTAGCAGGGTTGCATTO 
GTTGTTTCTTTTTTTTTAACTTTTTGCATTATAGTOT 
CAATCTTTGTAAAGCAAACATGTGC^TAGAAGATGATATCTG 
TTGCTCAGTTGTAAAAATCTTAGTGTTCTCAAGCAATCTTAATTAGCTT^ 

TAATTTAAACTTCATGTTACATCT^ 
ATATTAGAATAGAATGAGACAATTAAGTCTCTTC^ 
TCACTCTGTTGCCCAGGGTGGAGTGCAGTGGTGCGATCT 
GAGCAATTCTCCTGCCAATTCTCCTGCCTCACCCT 

GCCTCGCTAATGTTTGTATTTTTAGTAGAGAC^GGGTTTCACCATGTTGGCCA 

TGGCCGCAGGTGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAG 

GCCGAGTCTTTCAAAGAGGAATTAAATACATCAGATTA^ 

CCAAGACAAAATGGGAAAGAAGGAGTAAACTTACTTTC^^ 

CAAGGAAGGAGAAACTTTTTCTAGCACTAAATTCAAGAGGAAATT^ 

CTTTGGACAATATAATGTACAGTATATTTAAGTGACTTT 

TTAGAAGAAAATATTATGCACTTTGTAGCTCTCTGTA 

CACCTAAACTACAGAATTGGTACCTTTTAATTCAGTACCATAATAGTCTTAGAAACCTAGAGGAAATAGC 
TGTGGAACTGCATTTTTACTTCACTTTGACCTCTGGCATCAAGCTGTGAATGACGAATCACCCCTTTTTT 
TTTTCAAATCTTGACTAGATATCA.GAGGATACCTAGACATACTTCTGCTTCGCT 
TTTTCTGTTTAAAAGATTATCTTACATCTCACTTGC 

C^TTAACTAATTTGAACCTTCCAATAATACTGTGGACCAGGCATCAAATCAAACTGAGATCAGAGACGGT 
CAGGGGTCTTATAGAATATTTTTGGCAGAGGCAGGATTAGAACTCAGGCCGTGAGCTGCTGC^TCCTTTA 
GTGTGTGAGCTCCACGTTTGATGCTCAGGTATAATTTCCCACCAAGTTAAGTTGATTGCATTOT 
TTGGAGCTTTTGCCAATTATCAAAAATGCTTAGAAAAATTAATT^ 

CATGAGCTGTTGGAATCCCAACTCAGAGTAGCCTCTTTGCAAGACATGTCTTGCCAACTACTGGTGAATG 
CTGAAGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTC(^TGTTATTGGAAATCGGCTCAAACTTCTCTT 
GAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAGTTATTAGACGTGTCAAGTAGTGAGCAGGATTTG 
TCTTCCTGGTCTTCTGCTGATGAACTGGACACCTCAGGGTCTGTGAGTCCCACATCAGGAAGGAGCACCC 
CAAACAGAC^GAAAACGCCACGAGGCAAGTGTAGTCTCTCACAGCCCGGACCCTCTGT(^GCAGTCCACA 
TAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGAGCCAGGGCCAGGTCGGTCCGGCCGCGGC 
TTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTTCTCCTGCTCCTCCTCATCGGGCTTGCCT 
GCCTTGTACCGATGTCAGAGGAAGACTACAGCTGTGCCCTCTCC^ 

CATGCTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAAGCAGATGCCATCTGCAGAAGTGCTGGTA 

GCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTACCAACAAGAGGACOTTGATCTTGGCGAAAGCCC 

TCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAATTATGGCTTCAGAGGTGGAAGATAA 

ACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCTGGTTTGAGACTCTGAACCTTAG 

GACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCGCGGACTCATGAATTCTGGGCCCTTGGCCCATTCT 

GTGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCT 

ATGACCCTCCCAAGCACCATGTCAGTGTCGTACAATCTACG^ 

CCTTGTAACATACAATTTTTAAGAGCTTATATGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGGCATG 
ATGTTTTAACCTTTGCTTTAGAAGCACAAGCTGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATAAAGA 
AAAACTAGACGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGTAAAAAAA 
AAAGATATTTATGTATGTACAGTTTGCTAAAGCCAAGTTTTGTTTGTATTGATTTCTTTGCATTTATTAT 
AGATATTATAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SYNE1 raRNA sequence - var5 (public gi: 21734305) (SEQ ID NO: 187) 

CACTGGCACGACCGCTCTGCAGACAGCCTGCTTTCTCCACAGCCTTCCTCCAATCTCTCCCTCTCGCTCG 

CTCAGCCCCTCCGGAGCGAGCGGTGAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGTG 

GGATCACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCTGAGGATGAA 

GAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGAGCTGTTGCCTTATCAGATGTAATGATCCCCGAGA 

GCCCTGAGGCCTATGTAAAACTCACAGAAAATGCAATCAAAAATACCTCCGGGGACCACAGTGCCCTA^ 

GTCACAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCGCTTTCAGATACAGCAAACCGAAAATATC 

ATTCGCAGCAAAACTCCCACGGGGCCGGAGCTAGACACCAGCTACAAAGGCTACATGAAA 

AATGC^GTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGAGCCTTCC 

TGGCTTTGTTAACCTGCATAGTACCGAAACCCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCAG 

GCCCAGGCATTGAGGAAGGAGTTGAGGATGAAGGAGAACCTCCAGAAGTGGC^ 

TGAACAGC^TCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAAC^GCTCCAGCGTCTGGAACTCAG 

CACTGACATCCAGACCATCGAGCTCC^GATC^AT^AAGCTCAAGGAGCTCCAGAAAGCTGTGGACC^CCGC 

AAAGCCATCATCCTCTCCATCAATCTCTGCAGCC^ 

ACCTGGAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACOT 

GGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCAT^ 

CTGGAGAAC^TTGAC^GAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAGATACTTC 
AGGACCATCAC AAACAGCTTATGCAAATAAAG CATGAGCTGTTGGAATCCCAACT CAGAGTAGCCTCTTT 
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GC^GACATGTCTTGCCAACTACTGGTGAATGCTGAAGGAA 

CATGTTATTGGAAATCGGCTCAAACTTCTCTTGAAG 

TATTAGACGTGTCAAGTAGTC^CAGGATTTGTCTTC 

GTCTGTGAGTCCCACATCAGGAAGGAGCACCCCAAAC^^ 

TCACAGCCTGGACCCTCTGTCAGCAGTCCACATAGCAGGTCCACAAAAGGTC 

CTGAGCCAGGGCCAGGTCGGTCCGGCCGC^GCTTCCTGTTC^GAGTCCT 

GCTTCTCCTGCTCCTCCTCATCGGGOT 

CTCTCCAACAACTTTGCCCGGTCATTCCA 

CTAAGCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGGATCGGGT 
CAACAAGAG£ACCTTGATCTTGGCGAAAGCCATCGGTC 

GTGCAAATTATGGCTTCAGAGGTGGAAGATAAACAGTGACGGGGGAACAAACAG 

GAAGAAATCTGGTTTGAGACTCTGAACCTTAGCACTAAGGA^ 

GACTGATGAATTCTGGGCCCTTGGCCCATTCT 

TTTCCCATGGTGCTGCTCCAACCATCAGATJ^ 

ACCAACCAACGAGTGCTGAAGAGATTTTAGi^ 

TTCCTTTTTACCTTGTTTTCCTTTGGGGCATGATGTTT 

CTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAAT 

TGTGAAAAAAGTTGAATGTTATAGTAAAAAAAAAAAGATATTT^ 

TTTTGTTTGTATTGATTTCTTTG<^TTTATTA^ 

Human SYNE1 niRNA sequence - var6 (public gi: 21750070) (SEQ ID NO: 188) 

TCAGAGGGTGCTCAATGCTTTCCTGAAAGCTTG 

CAGGGGCTGC^GGAAGCTGTTCGAAAGCTCC^CAAACAATGGAAGGATCTTCAAGGAGAAG 
ATTTGCTTCATCTGAAGATTGATGTGGAGAAGAATAGGOT 

GCTGGATCGAGAGACCAAGCTGATGCCCCAGGAAGGCAGTGAAAAGATAATTAAAGAGCACAGGGTTTTC 

TTCAGTGACAAAGGTCCTC^TCATCTCTGTGAGAAAAGGTTACAGCTCATCGAGGAACTCTGTGTGAAAC 

TCCCAGTGCGGGACCC^GTAAGGGACACACCTGGAACCTGTCACGTGACTCTCAAAGAGCTCAGAGCTGC 

CATTGACAGCACCTACAGGAAGCTCATGGAAGACCCAGACAAGTGGAAGGACTAGACT 

GAGTTCTCATCTTGGATATCTACAAATGAGACACAATTAAAGGGGATCAAGGGTGAGGCCATCGATACTG 

CCAACCACGGAGAGGTTA^CGTGCCGTTGAAGAGAT(^GAAATGGTGTTACCAAAAGGGGTGAGACCCT 

CAGCTGGCTGAAATCCAGGCTGAAAGTTTTGACAGAAGTT^ 

GATGAGCTGGCAAAATTATCCAGCTCTTTCAAGGCTCTTGTGACGCTGCTGTCAGAGGTTGAAAAGATGC 
TAAGC^TTTTGGGGACTGTGTCCAGTACAAAGAAATAGTCAAAAATTCTCTCGAAGAATTAATTTCTGG 
CTCTAAAGAAGTCCAGGAACAAGCTGAGAAGATCTTGGATACTGAAAATCTGTTTGA 
CTTCTTCATCACCAGCAAAAGACAAAGCGGATCTCAGCAAAGAAGAGAGATGTGC^ 

AGGCGCAGCAGGGAGAAGGGGGGCTGC CTGACCGAGGCCACGAGGAG CTG CGGAAGCTGGA'GAGCACACT 
GGATGGCCTGGAGCGCAGCCGGGAGAGGCAGGAACGCCGCATCCAGGTCACATTAAGAAAATGGGAGCGA 
TTTGAAA.CAAACAAAGAAACAGTAGTAAGATACCTTTTTCAAACAGGTT 
GTTTTAGCAGTTTGGAAAGTTTATCTTC^GAAC^ 

TATTGC^GTCCAGGCTGAGAACCTTGTAAAGGAAGCTTCAGAGATACCGCTTGGGCCCCAAAATAAGCAG 
CTGCTTCAACAGCAGGCCAAGTC^^TCAAAGAAC^ 

ATGTGATTGAC^GTCCTAAACTTTCTTCTCTGAGATAAAGTTTCATACAATCTTTCCTGTACCTTGTAT 

TCAAAACACTCTTAAAATCTCAAAGTGTCTGTGTATTTCAGCATGTTTTGAGGAAAC^ 

AAAGAAAGTATCGCTAATACAGAAACCAATATCTATAACAGAGCCCAAAAAATATAAAGGATGTGGGTTT 

TG(^TCTTAAACTGATCATGTTCATGAGAAAGCCATATCTATTCTATTCTGTGGCCTTTGTACATTGTAG 

AGGGAATCTTGAAAAAGAACTAATATTTAAAATAATTTTTTTACTATATTATTCTGCTGTCACCATTTAG 

AGCGAAAAGGAGATATTTTGTTAGTGTAGATTCCAGGCCTAAATAC^ 

AACCTGAAGAAGCTCCTGGAGCTTGTTTACAGTGCCTCGGTATTCAAGTTATCCTGACTAATATGCTCTT 
TCCAGAAATTAACTTTAAAATATTTTATTTTTAAACTTTTAATGTTTGTTTATCTG 

Human SYNE 1 niRNA sequence - var7 (public gi: 28192521) (seq id no: 189) 

CATATACAGCTTGATAAATCTCTTCCTGCACCTCT 

TGGCCCTGAGAGAGGAAATAACCGTTCAACAGGTCCACGAGGAAACAGCAAACACGATACAACGGAAACT 
TGAGCAACATAAGAGAAAATGC CGGAQAA.TGATGGATCTGCTTCAAAACACGGATGC CCACAAAAGAGCA 
TTCCATGATW^TCTACCGGACCAGGTCTGTTAACGGGATTCCAGTGCCACCTGATCAATTAGAGGACATGG 
CCGAGAGGTTTCATTTTGTTTCCCCCACATCAGAGCT^ 

GTACCGTCTGCTCTCACTGCTGGTTCTTGC71GAGTCAAAGCTGAAGTCTTGGATCATTAAG 
AGAGAGTCAGTGGAGCAGCTTCTAGAAAACTACGTGTCTTT^ 

ATGAGGTGACATACCAGATCTTGAAACAGACAGCTGAGATGTATGTCAAAGCAGATGGTTCAGTGGAAGA 
AGCTGAGAATGTGATGAAATTC^TGAATGAAACCACC^ 

AGTGTGAGGAGCATGCTGGAAGAAGTGATCTCTAACTGGGATCGCTATGGCAATACAGTGGCTAGTCTGC 
AAGCCTGGCTAGAGGATGCTGAAAAAATGCTCAATCAATCAG 



Figure 36 part - 102 

147/202 



WO 2004/078130 



PCT7US2004/006308 



TTTACCTGATTGGATTCAGCAGCATACT 

GAGATGGTTTC CCGTGAC CTGAAGCAGCAATTACTGTTGCT AAATGGGCGGTGGAGGGAGTTGTTTATGG 

AAGTCAAGCAATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGGAATACAC^ 

CCTOTCTGCTTTTGCAACGGAAGCCCATAAGAAAC^T 

AAGCTATTAATTCAAGACTTGGAGGATATTGAGCAGAGGGTGCCTGTGATGGATGCCGA^ 

TTACAAAGAGAGCACACCTCATTACCAAAGAAAGC^ 

GTCAAAGCTCAAAGAGCAGCTAACCAAGGTCAAAG 

CTGTTGATTCCGTTGGAGGAATTAGAAAAGCAGATG 

AAATTATCACAGTTCTTGAGCGTGAGGCACAATCGAGTC 

Human SYNB1 mRN A sequence - var8 (public gi: 19584384) (seq id no: 190) 

AAGCTATTAATTCAAGACTTGGAGGATATTGAGCAGAGG^ 

TTACAAAGACAGCACACCTCATTACCAAAGAAAGC CCCCAAGAAGAAGG CGACCAT 
GTCAAAGCTCAAAGAGCAGCTAACCAAGGTCAAAGAATGTT 

CTGTTGATTCCGTTGGAGGAATTAGAAAAGCAGATGACGTCCTTTTATGACTCACTTGGGAAAATCA 

AAATTATCACAGTTCTTGAGCGTGAGGCACAATCGAGTGCCCTTTTTAAACA 

AGCTTGTCAAGAAAACTGTAAGAAAACCTTGACACrTATTGAGAAAGGCAGT 

GTGACCTTGAGCAACGTGTTAAAGCATTTTGATCAGACGAGGCTACAAAGA 

TTGCTTTTCAGAGTATGGTAAAGAAAACTGGAGATT 

GAAG^GTTTGAGGAGTCTCGAGCAGAGTTGGAGAAGGT^ 

AAGGGGGATCCAGAGGAGCTCCTGCGGAGACACACTGAGTTT^ 

ATGCTTTCCTGAAAGCTTGTGATGAACTCACCGACAT.CCTTCCAGAGCAGGAGCAGC^ 

AGCTGTTCGAAAGCTCCACAAACAATGGAAGGATOTTCAAGGAGAAGCCCCTTATCATTTGCTTCATCTG 

AAGATTGATGTGGAGAAGAATAGGTTCTTAGCCTOTGTAGAAGAATGCAGAACTGAGCTGGATCGAGAGA 

CCAAGCTGATGCCCCAGGAAGGCAGTGAAAAGATAAT^ 

TCCTC^TCATCTCTGTGAGAAAAGGTTACAGCTCATC^ 

CCAGTAAGGGACACACCTGGAACCTGTCACGTGACTCTCAAAGAGCTCA 

ACAGGAAGCTCATGGAAGACCCAGACAAGTGGAAGGACTACACTAGCAGATTCTCTGAGTTCT 
GATATCTACAAATGAGACACAATTAAAGGGGATCAAGGGTG 

GTTAAACGTGCCGTTGAAGAGATC^GAAATGGTGTTACCAT^AAGGGGTGAGACeCTCAGCTGGCTGAAAT 
CCAGGCTGAAAGTTTTGACAGAAGTTTCTTCTGAGAATGAAGCC^ 
ATTATCCAGCTCTTTCAAGGCTCTTGTGACGCT^ 
GACTGTGTCCAGTACA^GAAATAGTCAAAAATTCT 

AGGAAG^GCTGAGAAGATCTTGGATACTGAAAATCTGTTTGAAGCACAGCAGTTACTTCTTCAT 
GGAAAAGACAAAGCGGATCTCAGCAAAGAAGAGAGATGTGCAG 

GAAGGGGGGCTGCCTGACCGAGGCCACGAGGAGCTGCGGAAGCTGGAGAGCACACTGGATGGCCTGGAGC 

GCAGCCGGGAGAGGCAGGAACGCCGCATCGkGGTCA<^ 

AGAAAGAGTAGTAAGATACCTTTTTCAAACAGGTTC 

GAAAGTTTATCTTCAGAACTGGAACAAACAAAGGAGTTTTCTAAAC^ 

CTGAGAACCTTGTAAAGGAAGCTTCAGAGATACCGCTTGGGC^ 

GGCCAAGTCAATCAAAGAACAAGTCAAAAAATTAGAAGACACGCTT^ 

ATGGTGAAAACCAAGTGGGATCATTTTGGCAGTAATTTTGAGACTCTGTCCGTCTGGATAACTGAGAAAG 

AAAAAGAACTGAATGCCTTGGAAACTTCGTCA^ 

AATTCAGGAAATAGAAAGTAAGCTGkGCAGCATTGTAGGATT^ 

TTTGTTACCACTGGAGAATCTGCTCGAATTAAAGCCAAGTTGACACAAATAAGAAGATACGGGGAAGAGC 
TTCGAGAGCATGCACAGTGTCTGGAAGGAACAATCCTC 

GAACCTTAGAAAGATCCAGCAATCTGTGTCTGAATTTGAAGATA^CTTGCTGTTCCAATTAAAATATGT 

TCTTC^GCTACAGAAACATACAAAGTTCTTCAAGAAC^^ 

GC^GCGCGATCACTGCCTTCTCAGCCAGTC 

TGCGGCTCTACAGCAGCAATACGAGGACATCCTAAGGAGGGCGAAGGAGAGACAGACGGCGCTGGAGAAT 
CTGCTGGCCCACTGGCAGAGGCTAGAGAAAGAACTATCATCCTTTTTGACCTGGTTAGAGCGGGGTGAAG 
CTAAAGCCAGTTCCCCAGAAATGGACATTTCTGCAGA 

ACAGGCAAGTTCAAGGAAGTGTGAGGAAGGAAAAAATAAAATGCTTTTTGTTACAGTTAC^ 
ATAATAAAATAAA.TAAAACTTGTAAAAAAAAAAAAAAAA7VAAAAAAAA . 

Human SYNE1 raRNA sequence - var9 (public gi: 17861377) (seq id no: 191) 

AAGGTAAAGCCACTAGAGAGAAACTGAAAGAAAACATTC^ 
AAAATATGATTTATAGACCACAGATAGGAATTAAGAGTT^ 

AGGATTATCAAGAGGAAATTGCTATTGCTCAAGAGAACAAAATACAGCTCC^CAAATGGGAGAACGACT 
TGCTAAAGCCAGCCATGAAAGCAAAGCATCTGAGATTGAATACAAGCTGGGAAAGGTCAACGACCGGTGG 
CAGCATCTCCTGGACCTCATTGCAGCCAGGGTGAA 

TTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCAAGCCAATAGT 
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CTACGATTCCTGTAACTCGGAAGAAATAC^ 

GAGAAGCACAGTACAGGTGTTGCATCTGTCOTCAACCTGTGTGAAGTCCT 
GTGCCACTGATGCCGAGTGTGACTCTATACAGCAGGCT 

TTGTGCTATGTCCATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTC 

GACTATTCACGTTTTGAAGATTGGCTGAAGTCT^ 

TGATCTATACAGTTGCCAAGGAAGAACTAAAGAAATTTGAGGCTTT 

GACGCAGCTGGAACTGATCAACAAGCAGTACCGCCGCCTGGCCAGGGAGAACTO 

AGCCTCAAACAGATGGTTCACGAAGGCAACCAGAGATGGGA 

TGCGCAGACTCAAGCATTTTATTGGCCAGCGTGAG 

GCTCACAGAGATGGATCTGCAGCTCACTAATATTGAACATT^ 

AAGCAAOTCAAGGCCrrrCCAGCAGGAAATTTCACTGAACC^ 

GAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCGATCATCGAGGAGGAACT 
ACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGAAAGATACCATAAGAAACTGATCCGCCT 
GACGATGAGCACGACCTCTCAGACAGGGAGCTGGAGCTGGAAGACTCT 
GGCACGACCGCTCTGCAGACAGCCTGCTTTCTCCACAGCCTTCCTCCAATCTCT 

GCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGTGGGAT 
GACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATC 
GTCAGGATGACAAAGATTTCTACCTCCGGGGAGCTGTTGCCTTA 
TGAGGCCTATGTAAAACTCACAGAAAATGCAATCAAAA^^ 

CAGAT CCGACAA.CTGGGCAAAG CCCTGGATGATAGCCGCTTTCAGAT ACAGCAAACCGAAAA 

G CAGCAAAACT C C CACGGGGCCGGAG CTAGACACCAG CT ACAAAGG CT ACATGAAAC TG CTGGG CGAATG 

CAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAA 

TTTGTTAACCTGCATAGTACCGAAACCGAA^ 

AGG CATTGAG CAAGGAGTTGAGGATGAAG CAGAACCT CCAGAAGTGGC AG CAGTTTAACTCAGACTTGAA 
CAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGGAACTCAGCACT 
GACATCCAGACCATCGAGCTCCAGATCAAAAAGCTCAAGGAGCTCCAGAAAGCTGTGGACCACCGCAAAG 
CCATC^TCCTCTCCATCAATCTCTGCAGCCCTGAGTTCACCCAGGCTGACAGCAAGGAGAGCOTGGACCT 
GCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGC 
CTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGG 
AGAACATTGAGA.GAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAGATACTTCAGGA 
CCATCAGAAACAGCTTATGCAAATAAAGCATGAGCTGTTGGAATCCCAACTCA 

GACATGTCTTGCCAACTACTGGTGAATGCTGAAGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTCCATG 
TTATTGGAAATCGGCTCAAACTT CT CTTGAAGGAGGTCAGTCGT (^TATCAAGGAACTGGAGAAGTTATT 
AGACGTGTCAAGTAGTCAGCAGGATTTGTCTTCCTGGTCTT^ 
GTGAGTCCCACATGAGGAAGGAGCACCCCAAACAGACAGA^ 
AGCCTGGACCCTCTGTCAGCAGTCGACATAGCAGGTCC^ 

GCCAGGGCCAGGTCGGTCCGGCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTT 
CTCCTGCTCCTCCTCATCGGGCTTGCCTGCC^TC 
CCAACAACTTTGCCCGGTCATTCGACCCC^TGCTCAGATA 
GCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGG 

AAGAGGACCTTGATCTTGGCGAAAGCCCTCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGC 
AAATTATGGCTTCAGAGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAG 
.AAATCTGGTTTGAGAC^CTGAACCTTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACT 
CATGAATTCTGGGCCCTTGGCCCATTCTGTGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTC 
CCATGGTGCTGCTCCAACCATCAGATT^AATGACCCT 
ACCAACCAGTGCTGAAGAGATTTTAGAACCTTGTAACATACA^ 

TTTTTACCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTTTGCTTTAGAAGCACAAGCTGTAAATCTAA 
AAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTG 
AAAAAAGTTGAATGTTATAGT 

HmnanSYNEl mRNA sequence - varlO (public gi: i786i385) (seq id no: 192) 

CAAAAATCAGTCTGATCTCGGGAAACCTGGAGAAATTTATTTTCTGTACTCTAATGTTCTTTC^TTTTGG 
TGACCATCAAGGTGCTGGGAGAGGAATTAGATGGCTGTAATT 
ACAGAAATTCTTGGAAC^^AATGGCCAACTGGGTAAGCGA^ 
CTTGAGCAGCAGACCATTAGACAAGCTGAGAATCGGCTCTC 

AAGAATACAATGAAATGCTTGT^TTAATTTTGAAGTGGATTGAAAAAGCTAAAGTCTTGGCTCATGGAAC 
TATTGCATGGAATTCTGCAAGCCAGCTTCGGAAAC^UVTATATTTTGCATCAGACCCTGCTAGAAGAATCC 
AAAGAAATTGACAGTGAGCTGGAAGG^TGACTGAGAAATTAC^^ 
AAAAAATGTCTGkGCAAGTGGCAGAACTGGGACGGGAGACTGAGGAG^^ 

TTTG CAGAACCTCGAA.GATGCAGCTAAGGATATGAAAAAATTTG AAG CAGAGTTGAAAAAGTTACAAGCT 
GCCTTGGAGGAAGCCCAGGCAACACTGACTTCTCGAG 

CTCATCGGCAGCATTTGTTGTCTGAGATGGAGTC^CTGAAGCCGAAGGTGCAAG(^GTG(^GCTCTGCCA 
GAGTGCCCTCCGGATCCCCGAGGATGTGGTTGCCAGCTTACCTCTCTGTCATGCTGCTCTGCGGCTGCAG 
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GAAGAGGCCAGCCGGCTGCAGCACA^ 

ATGAACAATATGAGCAAGAAATGAAACATCTCCAGCAACTGAT^^ 
TAAACCTGTTGCC^CCAGT/VACATACAGGAGCTGCAGGCTCAGATT^ 
AAAATTAAGGGCTACCAGGAGCAGATCGCTTC^ 
AGCACGCGACCATGCTGCTGACGGTGACCGAGGTCGAGGG 

GGAGCTCCTCCCCACGCCTTCGGCCCACCCCTCTGTGGTCATGATGACTGCAGGTCGCT^ 

CTGTCACCGGTCACTGAGGAGTCTGGGGAGGAGGGAACCA^ 

GCTCCCCTTCACCTGTGGCTAATACAGATGCTTCTGCT 

TGCTGAGAGGTTGCAGACAGATGCTGCAAAAATTCACCCCAGCA 

CCGGGATTGGAGCCATCCGCTACTGCCAAACTGGGTGATTTGCAGCGTTCTTG^ 

TGATCAGTGAGAAGCAGCGCACACTCTATGAAGCTTTGGAGCGCCAGGAGAAGTACC^ 

GTCCATCTCTACGAAGATGGAGGCGATTGAGCTGAAACT 

<5AAAGCCAGATGGeTCAACATCAGGCATTGATGGATGAGATTCT 

TCCAGTCCTCTCTCGCAGAGGAGCTGGTATCCGAGTCITG 

GCAGTCCACGCTCACTGTCTTAGCCGAGCGAATGTCCA^ 

CTTTTGGAGGAGAAGTTGAATGATCAGCTGGAGGAACi^ 

GTGAAGCCGATGAGCTGGAC^GCTGGCTCTTGAGTACCAAGGCCA.CTCTGGACACTGCGCTGAGTCGAC^ 

CAAGGAGCCCATGGA(^TGGAGGCCCAGCTTATGGACT^ 

GTGGTGGCTTTATCAGAACTGTCAGTCCACAATGAGAACC^GC^ 

ACGAGGCCGAGCAGCTGGCTGGAAAGCTGAGAAGGCTCAAGGGGA 

GCATGATAAGCAGCTCAACATGCAGGGAACAGCACAGGAGAAGGAGGAGA 

ACGCAGAGCCCCGGCGTCCAGGAATGGCTGGCCCAAGCTCGCACCACATGGACCCAG(^GCGGCAGAGCA 
GTCTCCAGCAACAAAAAGAGTTAGAACAGGAATTAGCCGAG 

TCGTGGAGAGGAGATTCTAATTCAACATTCGGCGGCAGAGACCTCTGGTGATGCTGGCGAAAAACCTGAT 

GTGTTATCCCAGGAGTTGGGGATGGAAGGGGAGA7UVTCATCCGCTGAAGACCAGATGAGAATGAAATGGG 

AAAGCCTACATCAAGAATTTAGTACCAAGCAGAAACTACTACAGAATGTTCTGGAACAGG 

AGTGCTTTATAGGAGGCCAAATCGACTCTTGTCTGGTGTGCCACTGTACAAAGGGGACGTGCCAACCCAA 

GATAAATCTGCAGTTACATCTTTGCTGGATGGACTGAACCAAGCCTTCGAGGAGGTTTC^ 

GAGGGGCAAAGAGGCAGAGTATACACTTGGAGCAGAAGTTGT^ 

GTTGGATGACGTTGAAGAACGTTTATTTGTTGCCACAGCA.CTTTTACCAGAAGA 

TTCAACG^GAGATTCTTGCCAAAGACATTAAGGAAATGTCTGAAGAAATGGATA^ 

TTTCCCAAGCTTTTCCAGAGAATGGTGATAATCGAGA 

CAGGTTATCCITGCTAGACTC^GTAGTGAA^ 

CTAAATTTCI^GAATGATCTGAAAGTGCTGTTTAC^^ 

AACTGG CAAATGTGTTTGAACAGCCCGTAG CAGAACAAATAGAGGCAATACAACAGG CTGAAGATGGACT 
CAAAGAATTTGATGCAGGAATCATTGAATTAAAGAGGCGTGGTGACGAGCTACAGGTCGAGCAGCCGTCC 
ATGGAAGAACTCTCCAAGCTCCAGGACATGTATGATGAGCTGATGATGATCATTG 

GTCTGAATCAGAACCTTACACTCAAGAGTCAGTATGAGAGGGCCCTACAAGATCTGGCTGACCTGCTAGA 
AACTGGTCAGGAGAAGATGGCAGGAGACCAGAAAATCATCGTGTCTTCCAAAGAGGAAATCCAGCAACCA 
CTTGACAAACATAAGGAATACTTTCAGGGCCTGGAATCTC^ 

AGATAAT<^GCTTTGCAGTCCAAAAGGAAACCCAGTTCC^TAa\GAGCTGATGGCTCAGGCTTCTGCTGT 
ACTGAAACGGGCTCACAAGAGGGGTGTGGAGCTGGAGTACATTCTAGAGACGTGGTCCCATCTGGATGAG 
GACC^GCAGGAGCTCAGCAGACAGCTGGAGGTGGTGGAAAGCAGCATCCCAAGCGTGGGTCTGGTGGAGG 
AGAACGAGGACAGGCTTATTGACCGCATAACACTCTACCAGCATTTAAAATCTAGCCTTAATGAATACCA 
GCCCAAATTATATCAAGTATTAGATGATGGGAAACGACTTCTGATATCGATCAGCTGCTCAGATCTAGAA 
AGCCAACTAAATCAACTTGGAGAGTGCTGGCTAAGTAACACCAATAAAATGTCTAAGGAACTTCACAGAC 
TGGAAACAATATTGAAACACTGGAC CAGATATCAAAGTGAATCTGCAGATCTAATT CACTGGTTACAATC 
TGCAAAAGACCGGOTAGAATTTTGGACTCAGCAATCTGTGAC^ 

GATCATCTAAATGCTTTCCTGGAGTTTTCTAAAGAAGTGGATGCCCAATCTTCCCTGAAATCATCTGTTC 
TGAGTACTGGAAATCAGCTCCTTCGACTAAAAAAGGTGGACACAGCCACGCTGCGCTCTGAGTTGTCGCG 
CATTGATAGCCAGTGGACTGACCTGCTAACCAATATCCCAGCCGTCCAGGAGAAGCTCCACGAGCTCCAG 
ATGGATAAACTGCCTTCCCGCCATGCCATTTCTGA 

TTCAGAAGGATGAAGATAATATTAAAAATTCCATAGGTTACAAGGCAATTCATGAATACCTTC^GAAATA 

TAAGGGTTTTAAGATAGACATTAACTGTAAACAGCTGACAGTGGATTTTGTGAACCAGTCCGTGCTACAA 

ATCAGGAGT(^GGATGTGGAAAGTAAGCGTAGTGATAAGACTGATTTTGCTGAGCAACTTGGAGCAATGA 

ATAAAAGTTGGCAAATTCTGCA7VGGTCTAGTAACTGAGAAGATCCAGCTGTTGGAAGGCTTATTGGAATC 

TTGGTCAGAATATGAAAATAATGTACAATGTCTGAAAACATGGTTTGAAACCCAGGAAAAGAGACTAAAA 

CAACAGCATCGAATTGGAGATCAGGCTTCTGTTCAAAATGCACTGAAAGACTGTCAGGATCTGGAAGATC 

TGATTAAAGCAAAAGATAAAGAAGTAGAGAAAATTGAGCAGAATGGACTTGCTT^ 

AGAAGACGTCTCTAGCATTGTCATGAGGACACTGCGAGAGCTCGGCC^ 

ATGGTTGGACAATTAAAGATACTGCTGAAATCAGTGCTTGACCAATGGAGTAGTCACAAAGTGGCCTTTG 
ACAAGATAAACAGTTACCTCATGGAGGCCZAGATACT!CTCTTTCCCGATTCCGTCTGCTGACTGGCTCCTT 
AGAAGCTGTGCAAGTTCAGGTGGACAATCTTCAGAATCTCCAAGATGATCTGGAAAAACAGGAAAGGAGC 
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TTACAGAAATTTGGCTCTATCACCAA^ 

CCAATACACTGAAAGAAGTCAAGATGAGATGGAATAACTTGC^ 

CAGCAAGGCCCTACTTCAGCTTTGGCAAAGATACAAGG 

CAGCAGGAGGATCGAACCAATGAGCTGTTGAAGGC^GCCA 

CCACATGGATTCAAGATTGCAACGACCTCCTCAAAGG^ 

CCATGAGCTGGGAGAGCAACTGAAGCAACAAGTGGATGCT^ 

CTCTCTTTGAGTCAACACTTGTGTGCCCTGGAGCAAGCTCTCTC 

GAGTTCTTGATTATGAAACCTTTGCCAAG 

AATACTACAAGGGCAGGACCCTAGCCACTCATCTGAC^ 

AAGGGACAGATGTTAAAATTCAGCAGCATGGCTCCAGATTO 

TACCCTTGAATGATAAGGAAATCAAAAGAATGCAGAATCTGAACOT 

GACTACAGAAAGATTCAGCAAGTTGCAGTCATTTTTGCTAC^ 

AC^TGGATGGAATTCCTAGTTCAGAC^GAACAAAAGTTAGCAGTAGAGAT^ 

TTTTGGAACAGCAGAGAGCACACGAGTTGTTTCAAGCCGAGATGTTCAGT 

AATCATTATTGATGGGCA^CGTC^CTAGA^ 

TTGACACTCCTCAGTAATCAATGGCAGGGAGTGATT^ 

GCCAGATTCGCCAGTGGCAGCGCTATAGGGAGATGGCAGAAAAGCT^ 

CTACCTCCCCATGAGTGGTCTCGGAAGTGTTCCTATACCACT 

GTGCAGTTCAAAGAAAAAGTGTTTCTGCGGCAACAAG 

AACTCCTTCTCTCGGCGGAC^GTGGCGCTGAGGCCGCCTTGCAGGCCGAACTCGCTGAAATCCAAGAG^ 

ATGGAAATCAGCCAGCATGCGGCTGGAAGAACAGAAGAAAAAACTAGCCTTCT^ 

AAATGTGAGAAAGGAATAGCAGATTCCCTGGAGAAACTACGAACTTTCAAAAAGAAGCT^ 

TCCCGGATCACCATGAAGAGCTCC^TGCAGAACAAATGCGTTGCAAGGAATTAGAAAATGCAGTTGGGAG 

CTGGACAGATGACTTGACCC^GTTGAGCCTGCTGAAGGACACCCTCTCTGCCTATATCAGTGCTGATGAT 

ATCTCCATTCTTAATGAACGCGTAGAGCTTCTGCAAAGGCAGTC 

TAAGGCGGCAGCAAATAGGTGAAAGATTGAATGAATGGGCAGTCTTCAGTGAAAAGAAC^GGAACTCTG 

TGAGTGGTTGACTCAAATGGAAAGCAAAGTTTCTCAGAATGGAGACATTCTCATTGAAG 

AAGCTCAAGAAGGATTATCAAGAGGAAATTGCTATTGCT 

GAGAACGACTTGCTAAAGCCAGC(^TGAAAGCAAAGCATCTGAGATTGAATACAAGCTGGGA 

CGACCGGTGGCAGCATCTCCTGGACCTCATTGCAGCCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCC 

GTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTC^GAGCTGGC^ 

AGCCAATAGT CTACGATT CCTGTAACTCGGAAGAAATACAGAGAAAGCTTAATGAGCAGCAGGAGCTTCA 

GAGAGACATAGAGAAGCACAGTACAGGTGTTGCATCTGTCCTCAACCTGTGTGAAGTCCTGCTGCACGAC 

TGTGACGCCTGTGCCACTGATGCCGAGTGTGACTCTATACAGCAGGCTACGAGAAACCTGGACCGGCGGT 

GGAGAAACATTTGTGCTATGTCCATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAA 

ATTTCTGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCT 

TCTTCTGGGGTGATCTATACAGTTGCCAAGGAAGAACTAAAGAAATTTGAGGCTTT 

ACGAGTGCCTGACGCAGCTGGAACTGATCAACAAGCAGTACCGC^ 

TTCAGCATGTAGCCTCAAACAGATGGTTCACGAAGGCAACC 

ACCTCCATeTTGCGCAGACTCAAGCATTTTATTGGCCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCA 

TTCTGGTCTGGCTCAC^GAGATGGATCTGCAGCTCACTAATATTGAACATTTTTCTC 

AGCTAAAATAAAGCAACTCAAGGCCTTCCAGCAGGAAATTTCACTGAACGA.CAATAAGATT 

ATTGCCCAAGGAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCGATCATCGAGGAGGAACTAG 

ATGAGCTCCGACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGAAAGATACCATAAGAAACTGATCCGCCT 

GCCTCTCCCAGACGATGAGCACGACCTCTCAGACAGGGAGCTGGAGCTGGAAGACTCTGCAG^ 

GACCTGCACTGGCACGACCGCTCTGCAGACAGCCTGCTTTCT 

CGCTCGCTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCT 

GGAGTGGGATCACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCT 

GATGAAGAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGAGCT 

CCCTAGAGTCACAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCGCTTTCAGATACAGCAA^ 

AAATATCATTCGCAGCAAAACTCCCACGGGGCCGGAGCTAGACACCAGCTACAAAGGCTACAT 

CTGGGCGAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGA 

GCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAAACCCAAACGGCTGGTGTGATTGACCGATGGGAGCT 

TCTCCAGGCCCAGGCATTGAGCAAGGAGTTGAGGATGAAGCAGAACCTCCAGAAGTGGCAGCAGTTTAAC 

TCAGACTTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGG 

AACTCAGCACTGACATCCAGACCATCGAGCTCCAGA 

CCACCGCAAAGCCATCATCCTCTCCATCyUVTCTCT 

AGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGG 
AGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAATGAGCCATGGTTTGCT 
TC^ATGCTGGAGAACATTGA(^GAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAG 
ATACTTCAGGACCATCACAAAGAGCTTATGCA^^ 

CCTCTTTGCAAGACATGTCTTGCCAACTACTGGTGAATGCTGAAGGAACAGACTGTTTAGAAGCCAAAGA 
AAAAGTCCATGTTATTGGAAATCGGCTCAAACTTCTCTTGAAGGAGGTCAGTCGTCATATG^^ 
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GAGAAGTTATTAGACXjTGTCAAGTAGTCAGCAGGATTTGT 

CCTCAGGGTCTGTGAGTCCCACATCAGGAAGGAGCACCCC^^ 

TAGTCTCTCACAGCCTGGACCCTCTGTCAGCAGTCC^ 

CCCTTCAGCTTCTCCTGCTCCTCCTGATCGGGC^ 

CI^TGCCCTCTCCAACAACTTTGCCCGGTCATTCCACCCCATC 

CTCTGAACTAAGCAGATGCCA.TCTGCAGAAGTGCTGGTAG 

AAACTACCAACAAGAGGACCTTGATCTTGGCGAAAGCCCTO^ 

CACATGTGTGCAAATTATGGOTTCA.GAGGTGGAAGATAAACAGTGACGGG 

AGGTTTGGAAGAAATCTGGTTTGAGACTCTGAACCTTAGCACT 

TTCCCCGGACTCATGAATTCTGGGCCCTTGGCCC^ 

GGGCAGCTTTCCCATGGTGCTGCTCGAACC^TCA 

ACAATCTACCAACCAACCAGTGCTGAAGAGATTTTAGAACCTTGT 

TGGGAGCTTCCTTTTTACCTTGTTTTCCTT 

TGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATAAGATCATGGAA 
GGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE1 niRNA sequence - varll (public gi: 17227153) (seq id no: 193) 

AACTCCTTCTCTCGGCX5GACAGTGGCGCTGAGGCCX3CCT 
ATGGAAATCAGCCAGC^TGCGGCTGGAAGAACAGAAGAAAAAA^^ 
AAATGTGAGAAAGGAATAGC^GATTCCCTGGAGAAACTACGAACTT^ 
TCCCGGATGA.CCATGAA.GAGCTCCATGCAGAA 

CTGGACAGATGACTTGACCCAGTTGAG CCTGCTGAAGGAC ACCCTCT CTGCCTATAT CAGTG CTGATGAT 
ATCTCCATTCTTAATGAA.CGCGTAGAGCTTCTGCAAAGGCAGTGGGAAGAACTATGCCACCAGCTCTCCT 
TAAGGCGGCAGCAAATAGGTGAAAGATTGAATGAATGGGCAGTCTT 
TGAGTGGTTGACTCAAATGGAAAGCAAAGTTTCTCAGAATC 

AAGCT CAAGAAGGATTAT CAAGAGGAAATTG CTATTGCT C AAG AG AACAAAATACAGCTCCAACAAATGG 

GAGAACGACTTGCTAAAGCCAGCCATGAAAGCAAAGCATCTGAGATTGAA 

CGACCGGTGGCAGCATCTCCTGGACCTCATTGCA^ 

GTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCA 
AGCCAATAGTCTACGATTCCTGTAACTCGGAAGAAATACAGAGAAAGCTTAATGAGCAGCAGGAGCTTCA 
GAGAGACATAGAGAAGCACAGTACAGGTGTTG<^TCTGTC^ 

TGTGACGCCTGTGCCAOTGATGCCGAGTGTGACTCTATACAGCAGGCTACGAGAAACCTGGACCGGCGGT 
GGAGAAACATTTGTGCTATGTCCATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAA 
ATTTCTGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCTTCAGAAAGGACAGCTGCTTTTCCCAGC 
TCTTCTGGGGTGATCTATAGAGTTGCCAAGGAAGAACT^ 

ACGAGTGCCTGACGCAGCTGGAACTGATCAACAAGCAGTACCGCCGCCTGGCCAGGGAGAACCGCACTGA 
TTCAGCATGTAGCCTCAAACAGATGGTTC^CGAAGGCAACCA^ 

ACCTCCATCTTGCGCAGACTCAAGCATTTTATTGGCCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCA 
TTCTGGTCTGGCTCACAGAGATGGATCTGCAGCTCACTAATATTGAACATTTTTCTGAGTGTGATGTTCA 
AGCTAAAATAAAGCAACTCAAGGCCTTCCAGCAGGAAATT^ 

ATTGCCCAAGGAGAACAGCTGATAGAAAAGAGTGAGCCCTTGGATGCAGCGATCATCGAGGAGGAACTAG 
ATGAGCTCCGACGGTACTGCCAGGAGGTCTTC^GGCGTGTGGAAAGATACCATAAGAAACTGATCCGCCT 
GCCTCTCCCAGACGATGAGCACGACCTCTCAGACAGGGAGCTGGAGCTGGAAGACTCTGCAGCTCTGTCG 
GACCTGCACTGGCACGACCGCTCTGCAGACAGCCTC 

CGCTCGCTCAGCCCCTCCGGAGCGAGCGGTCAGGACGAGACACCCCAGCTAGTGTGGACTCCATCCCCCT 
GGAGTGGGATCACGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCCAGAGCTCTGCCCTCTGAG 
GATGAAGAAGGTCAGGATGACAAAGATTTCTACCTCCGGGGAGCTGTTGCCTTATCAGGGGACCACAGTG 
CCCTAGAGTCACAGATCCGACAACTGGGCAAAGCCCTG^^ 

AAATATCATTCGCAGCAAAACTCCCACGGGGCCGGAGCTAGACACCAGCTACAAAGGCTACATGAAACTG 
CTGGGCGAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGA 
GCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAAACC 

TCTCCAGGCCCAGGCATTGAGC^GGAGTTGAGGATGAAGC^GAACCTCCAGAAGTGGCAGCAGTTTAAC 
TCAGAOTTGAACAGCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAAGAGCTCCAGCGT 
AACTCAGCACTGACATCCAGACCATCGAGCTCCAGATCAAAAAGCTCAAGGAGCTCCA 
CCACCGCAAAGCCATC^TCCTCTCCATCAATCTCT 

AGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCT 

AGTGGCGGGGCOTGCTGC^GGATGCCCTGATGCAGTGCCAGGGTTTCCATGAAATGAGCCATGGTTTGCT 

TCTTATGCTGGAGAACATTGACAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAG 

ATACTTCAGGACCATCACAAACAGCTTATGGAAATAAA 

CCTCTTTGCAAGACATGTCTTGCCAACTACTC 

AAAAGTCCATGTTATTGGAAATCGGCTCAAACTTCTOTTGAAGGAGGTCAGTCGTCATATCAA 
GAGAAGTTATTAGACGTGTCAAGTAGTCAGCAGGATTTGTCTTCCTGGTCTTCTGCTGATGAACTGGACA 
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CCTCAGGGTCTCTGAGTCCC^CA^ 
TAGTCTCTCACAGCCTGGACCCTCTGT 

TCCCTTTCTGAGCCAGGGCCAGGTCGGTCCXSGCCGCGGCTTCC^ 
CCCTTGAGCTTCTCCTGCTCCTCCTC^ 

CTGTGCCCTCTCCAA.CAACTTTGCCCXX3TCATTCCACCCCATGCT 
CTCTGAACTAAGC^GATGCCATCTGCAG^ 
AAACTACCAACAAGAGGACCTTGATCTTGGCGAAAGCCCT^ 
GACATGTGTGCAAATTATGGCTTGAGAG 

AGGTTTGGAAGAAATCTGGTTTGAGACTCK3AACCTTAGCACTA 

TTCCCCGGACTCATGAATTCTGGGCCCTTGGCCCATTCTGTG^ 

GGGCAGCTTTCCCATGGTGCTGCTCCAACCATCAGATA^ 

ACAATCTACGAACCAACCAGTGCTGAAGAGATTTTAGA 

TGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTT 

TGTAAATCTAA^GGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATA 

GGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Hxrraaa SYNE 1 niRNA sequence - varl2 (public gi: 16550165) (seq id no: 194) 

ACAAAAGAGCATTCCATGAAATCTACCGGACCAGGTCTGTTAACGGGATTCCAGTGCCACCTGATCA^ 

AGAGGACATGGCCGAGAGGTTTCATTTTGTTTCCTCCACATCAGAGCTACACCTAATO 

TTAGAATTAAAGTACCGTCTGCTCTCA.CTGCTGGTTCTTGCAGAGTCAA 

AGTACGGGAGGAGAGAGTCAGTGGAGCAGCTTCTACAAAACTACGTC 

CTTTGAA.CAATATGAGGTGACATACCAGATCTTGAAACAGACAGOT 

TCAGTGGAAGAAGCTGAGAATGTGATGAAATTCATGAATGAAACCACCGCTCAGTGGAGGAATCTCTCAG 
TAGAAGTGAGGAGTGTGAGGAGCATGCTGGAAGAAGTGATCTCTAACTGGGATCGCTATGGCAATACAGT 
GGCTAGTCTGCAAGCCTGGCTAGAGGATGCTGAAAAAA 
TTTTTTCGAAATTTACCTCATTGGATTCAGC^ 

AAACCTGTGATGAGATGGTTTCCCGTGACCTGAAGCAGCAATTACTGTTGCTAAATGGGCGGTGGAGGGA 

GTTGTTTATGGAAGTCAAGCAATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGQ 

TGTGTTGTTACCCTGTCTGCTTTTGCGACGGAAGCCCATAAGAAACTTTOT 

TTATGAATGTCAAGCTATTAATTCAAGACTTGGAGGATATTGAGCAGAGGGTGCCTGTGATGGATGCCCA 

ATACAAGATAATTACAAAGACAGCACACCTCZATTACCAAAGAAAGCCCCCAAGAAGAAGGAAAAGAAATG 

TTTGCGACCATGTCAAAGCTCAAAGAGCAGCTAACCAAGGTCAAAGAATGTTAC^ 

AGTCTCAGCAGCTGTTGATTCCGTTGGAGGAATTAGAAAAGC^GATGACGTCCTTTTATGACTCACTT 

GAAAATCAATGAAATTATCACAGTTCTTGAGCGTGAGGCACAATCGAGTGCCCTTTTTAAACAAAAACAT 

CAGGAACTGTTAGCTTGTCAAGAAAACTGTAAGAAAACCTTGACACTTATTGAGAAAGGCAGTCAAAGTG 

TTCAAAAGTTTGTGACCTTGAGCAACGTGTTAAAGCATT^ 

AGATATTCATGTTGCTTTTC^GAGTATGGTAAAG 

AGTCGCTTGATGAAGAAGTTTGAGGAGTCTCGAGCAGAGTTGGAGAAGGTACTGCGGATTGCTCAGGAGG 
GCCTGGAGGAAAAGGGGGATCCAGAGGAGCTCCTGCGGAGAC^C^CTGAGTTTTTCAGTCAGCTGGATCA 
GAGGGTGCTCAATGCTTTCCTGAAAGCTTGTGATGAACTCACCGACATCCTTCCAGAGCAGGAGCAGCAG 
GGGCTGCAGGAAGCTGTTCGAAAGCTCCAGAAACAATGGAAGGTGAGT 

GCATCCTCAATGAAGGGAGAAGCTTGAGCGTGTTAAGGTCCAAATGTAAGAGAAATTTAGAAATTCCTGG 

AAAGTGACTGTAACTATTTCGCTCATTTAAAAACT 

TG 

Human SYNE1 mRNA sequence - varl3 (public gi: 16553949) (seq id no: 195) 

ATAGTAGAATTATTCATTATAATATTTGGCTTTG 
TTTATTTTCTGTACTCTAATGTTCTTTCATT 

TGTAATTCAAAGTTAATGGAATTAGATGCAGCAGTACAGAAATTCTTGGAACAGAATGGCCAACTGGGTA 
AGCCACTGGCCAAGAAGATAGGAAAACTGACTGAACTTCACCAGCAGACCATTAGACAAGCTGAGAATCG 
GCTCTCCAAGCTCAATCAGGC^GCATCACATTTAGAAGAAT^ 

TGGATTGAAAAAGCTAAAGTCTTGGCTCATGGAACTATTGCATGGAATTCTGCAAGCGA 
AATATATTTTGCATCAGACCCTGCTAGAAGAATCC^ 

GAAATTAGAGTACCTCACTAGCGTGTACTGTACAGAAAAAATGTCTCAGCAAGTGGCA 

GAGACTGAGGAGTTGCGACAGATGATCAAAATTCGTTTGCAGAACCTCC^GATGCAGCTAA 

AAAAATTTGAAGCAGAGTTGAAAAAGTTACAAGCTGCCTTGGAG^^ 

AGAAGTTGGACGTCTCAGTCTCAAGGAGCAGCTCTCTCATCGGCAGCATTTGTTGTCTGAGATGGAGT^ 
CTGAAGCCGAAGGTGCAAGCAGTGGAGCTCTGCGAGAGTC 

GCTTACCTCTCTGTC^TGCTGCTCTGCGGCTGCAGGAAGAGGCCAGCCGGCTGCAGCACACCGCCATCCA 
GGAGTGTAACATGATGCAGGAAGCTGTGGTACAATATO 

CAACTGATAGAAGGAGCTCACAGAGAGATTGAGGATAAACCTGTTGCCACCAGTAACATACAGGAG 
AGGCTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAAGCAGATGCCATCTGCAGAAGTGCTGGTAG 
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CATAAGGAGGATCGGGTCATAAGCAATCCCAAACTAC^^ 

CGGTGTGGCAGCTTTAGCCCTCCTCGAGAT 

CAGTGACGGGGGAACAAACAGACAACAAGAAGGTT^ 

ACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAATTCT^ 

TGCAGAGCCAAGGACTTCAGTAGACCATCTGGGCA 

TGACCCTCCCAAGCACCATGTCAGTGTCGTACT^ 

CTTGTAACATACAATTTTTAAGAGCTTATATGGCAG 

TGTTTTAACCTTTGCTTTAGAAGCACAAGCTGTAAA 

AAACTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGTAAAA 

AAGATATTTATGTATGTACAGTTTGCTAAAGCCA 

GATATTATAAAAT 

Human SYNEt raRNA sequence - varl4 (public gi= i2698056) (seq id no: 196) 

ACAACGGAAACTTGAGCAACATAAGGATCTGCTTCAAAACACGGATGCCC^ 

ATCTACCGGACCAGGTCTGTTAACGGGATTCCAGTGCCACCTGATCAATTAGAGGACATGGCC^ 

TTCATTTTGTTTCCTCGACATCAGAGCT 

GCTCTCACTGCTGGTTCTTGCAGAGTCAAAGCTGAAGTCTTGGATCAT^ 

GTGGAGCAGCTTCTACAAAACTACGTGTCTTTTATAGAAAATAGCAAGCT 

CATACCAGATCTTGAAACAGAC^GCTGAGATGTATGTCA 

TGTGATGAAATTCATGAATGAAACCACCGCT(^GTGGAGGAATCTCTCAGTAGAAGTGAGGAGTC 
AGCATGCTGGAAGAAGTGATCTOTAACTGGGATCGCTATGGCAATACAGTGGCTAGTCTGCAAGC 
TAGAGGATGCTGAAAAAATGCTCAATCAATCAGAAAATGCCAAAAAGG 

TTGGATT(^GCAGCATACTGCCATGAACGATGCTGGCAATTTTCTAATTGAAACCTGTGATGAGATGGTT 

TCCCGTGACCTGAAGCAGCAATTACTGTTGCTAAATGGGCGGTGGAGGGAGTTGTTTATGGAAGTCAAGC 

AATATGCTCAAGCTGATGAGATGGACAGAATGAAGAAGGAATACACAGACTGTGTTGTTACCCTGTCTGC 

TTTTGCAACGGAAGCCCATAAGAAACTTTCTGAACCCTTAGAAGTCTCTTTTATGAATGTCAAGCTATTA 

ATTCAAGACTTGGAGGATATTGAGCAGAGGGTGCCTGTGATGGATGCCCAAT^ 

CAGCAGACCTCATTACCAAAGAAAGCCCCCAAGAAG 

CAAAGAGCAGCTAACO^AGGTCAAAGAATGTTACT^ 

CCGTTGGAGGAATTAGAAAAGCAGATGACGTCCTTTTATGACTCACTTGGGAAAATCAATGAAATTATCA 
CAGTTCTTGAGCGTGAGGCACAATCGAGTGCCCTTTTTA 

AGAAAACTGTAAGAAAACCTTGACACTTATTGAGAAAGGCAGTCAAAGTGTTCAAAAGTTTGTGACCTTG 
AGCAACGTGTTAAAGC^TTTTGATCAGACGAGGCTAC^AAGACAGATTGCAGATATTCATGTTGCTTTTC 
AGAGTATGGTAAAGAAAACTGGAGATTGGAAGAAGCATGTGGAAACCAACAGTCGCTTGATGAAGAAGTT 
TGAGGAGTCTCGAGCAGAGTTGGAGAAGGTACTGCGGATTGCTCAGGAGGGCCTGGAGGAAAAGGGGGAT 
CCAGAGGAGCTCCTGCGGAGACACACTGAGTTTTTC^^ 

TGAAAGCTTGTGATGAACTCACCGACATCCTTCCAGAGCAGGAGC^GC^GGGGCTGCAGGAAGCTGTTCG. 
AAAGCTCCACAAACAATGGAAGGATCTTGAAG^^ 

GTGGAGAAGAATAGGTTCTTAGCCTCTGTAGAAGAATGCAGAACTGAGCTGGATCGAGAGACCAAGCTGA 
TGCCCCAGGAAGGCAGTGAAAAGATAATTAAAGAGCACAGGGTTTTCTTCAGTGACAAAGGTCCTCATCA 
TCTCTGTGAGAAAAGGTTAC^GCTCATCGAGGAACTCTGTGTGAAACTCCCAGTGCGGGACCCAGTAAGG 
GACACACCTGGAACCTGTCACGTGACTCTCAAAGAGCTCAGAGCTGCCATTGACAGCACCTACAGGAAGC 
TCATGGAAGACCCAGACAAGTGGAAGGACTACACTAGCAGATTCTCTGAGTTCTCATCTTGGATATCTAC 
AAATGAGACACAATTAAAGGGGATCAAGGGTGAGGCCATCGATACTGCCAACCACGGAGAGGTTAAACGT 
GCCGTTGAAGAGATCAGAAATGGTGTTACCAAAAGGGGTGAGACCCTCAGCTGGCTGAAATCCAGGCTGA 
AAGTTTTGACAGAAGTTTCTTCTGAGAATGAAGCCCAAAAGCAGGGAGATGAGCTGGCAAAATTATCCAG 
CTCTTTCAA^CTCTTGTGACGCTGCTGTCAGAGGTTGAAAAGATGCTAAGCAATTTTGGGGACTGTGTC 
. CAGTACAAAGAAATAGTCAAAAATTCTCTCGAAGAATTAATTTCTGGCTCTAAAGAAGTCCAGGAACAAG 
CTGAGAAGATCTTGGATACTGAAAATCTGTTTGAAGCACAG(^GTTACTTCTTCATCACCAGCAAAAGAC 
AAAGCGGATCTCAGCAAAGAAG AGAGATGTG CAGCAGCAGATCGCG CAGGCGCAGCAGGGAGAAGGGGGG 
CTGCCTGACCGAGGCCACGAGGAGCTGCGGAAGCTGGAGAGCACACTGGATGGCCTGGAGCGCAGCCGGG 
AGAGGCAGGAACGCCGGATCCAGGTCACATTAAGAAAATC 

AGTAAGATACCTTTTTCAAACAGGTTCCAGTCATGAACGCTTCTTGAGTTTTAGCAGTTTGGAAAGTTTA 

TCTTCAGAACTGGAACAAACT^AAGGAGTTTTCTAAACGGACAGAAAGTATTGCAGTCCAGGCTGAGAACC 

TTGTAAAGGAAGCTTCAGAGATACCGCTTGGGCCCCAAAATAAGCAGCTGCTTCAACAG 

AATCAAAGAACAAGTCAAAAAATTAGAAGACACGCTTGAAGAAGAGTATGTGATTGACAAGTCCTAAAC^ 

TTCTTCTCTGAGATAAAGTTTCATACAA^ 

AAAGTGTCTGTGTATTTCAGCATGTTTTGAGGAAACAACTCACAGTTCAAAAGAAAGTA 

GAAACCAATATCTATAACAGAGCCCAAAAAATATAAAGGATGTGGGTTTTGCATCTTAAACTGATCATGT 

TCATGAGAAAGCCATATCTATTCTATTCTGTGGCCTTTGTACATTGTAGAGGGAATCTTGAAAAAGAACT 

AATATTTAAAATAATTTTTTTACTATATTA^ 

TAGTGTAGATTCCAGGCCTAAATACACATCAC^ 

CTTGTTTACAGTGCCTCGGTATTCAAGTTATCCTGACTAATATGCTCTTTCCAGAT^TTAACTTTAAAAT 
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ATTTTATTTTTAAACTTTTAATGTTTGTTTATCTG 

Human SYNE1 inKNA sequence - varlS {public gi: 289S592) (seq id no: 197) 

CAACCTGCATAGTAACGAAACCAAACGGCTGGTGTGATTGACCGATGG 

TGAGCAAGGAGTTGAGGATGAAGCAGAACCTCCAGAAGTGGC^^ 

CTGGGCCTGGCTGGGGGACACGGAGGAGGAGTTGGAACAGCT 

CAGACCATCGAGCTCCAGATCAAAAAGCTCAAGGAGCTC<^^ 

TCCTCTCCATCAATCTCTGCAGCCCTGAGTTCACCC^ 

TCGCTTGTCGCAGATGAATGGGCGCTGGGACCX3AGTGTC 

CAGGATGCCCTGATGCAGTGCCAGGGTTTCX1ATGAAATGAGCCA 

TTGAGAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTG^ 

CAAACAGCTTATGGAAATAAAGGATGAGCTGTTGGAATCCCAACTCAGA 

TCTTGCCAACTACTGGTGAATGCTGAAGGAAC^^ 

GAAATCGGCTCAAACTTCTCTTGAAGGAGGTCAGTCGT 

GTCAAGTAGTCAGCAGGATTTGTCTTCCTGGTCTTCT^ 

CCCACATCAGGAAGGAGCACCCCAAACAGACAGAAAACGC 

GACCCTCTGTCAGCAGTCCACATAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGAGCCAGG 
GCCAGGTCGGTCCGGCCGCGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTTCTCCTG 
CTCCTCCTCATCGGGCTTGCCTGCCTTGTACC^ 

ACTTTGCCCGGTCATCCACCCCATGCTCAGATAC^CGAATGGCCCTCCTC(^CTCTGAACTAAGCAGATG 
CCATCTGGAGAAGTGCTGGTAGCATAAGGAGGATCG^^ 

CCTTGATCTTGGCGAAAGCC^TCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAATTAT 

GGCTTCAGAGGGTGGAAGATAAACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGA^ 

GGTTTGAGACTCTGAACCrTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAA 

TTCTGGGCCCTTGGCATTCGTGTGCACAGCCAAG 

GCTGCTCCAACCATCAGATAAATGACCCTCCCCAAGCACCATGTC^ 

GAGTGCTGAAGAGATTTTAGAACCTTGTAACATACA 

CCTTGTTTTCCTTTGGGGCATGATGTTTTAACCTTTTGCTTTAGAAGCACAAGCTGTAAATCTAAAAGGC 
" ACTTTTTTCTAGAGGTATAAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAA 
GTTGAATGTTATAGTAAAAAAAAAAA 

Human SYNE 1 raKNA sequence - varl6 (public gi: 6330956) (seq id no: 198) 

CTCGATTTGTGCCGTCAGTCTAACAACCTGTGCTTGCA^ 

ACCATGACTGTATGAATGTTGTTGAAGTGTTCCTAGAAAAATTTACTACAGAATGGGATAACTTGGCCAG 
ATCTGATGGAGAGAGTACAGCTGTCCACCTGGAAGCTTTGAAAAAGT^ 

AAGTATGCTATTGAAGATCTGAAAGATCAAAAGCAGAAAATGATAGAGCATCTGAATTTAGATGACAAGG 

AGTTAGTCAAAGAACAGACGAGTCATTTAGAGCAACGTTGGTTTCAGCTTGAGGACCTCATTA 

AATCCAAGTGTCAGTCACCAACTTGGAGGAGTTAAATGTGGTGCAGTCCAGATTTCAGGAGCTAATGGAG 

TGGGCAGAAGAGCAACAACCCAACATCGCCGAGGCCCTTAAGCAGAGCCCTCCTCC^ 

ACCTTCTCATGGATCACCTGGCCATCTGCAGTGAACTGGAGGCC^^GCAGATGCTCCTGAAATCGCTTAT 

AAAGGACGCAGAC^GGGTCATGGCAGATCTTGGTCTCAATGAGCGACAGGTCATCGAGAAGGCTCTCTCT 

GATGCACAAAGCCACGTGAATTGTCTCAGTGACTTAGTGGGCCAGCGAAGAAAGTACTTAAACAAAGCCT 

TGTCCGAGAAAACCCAGTTTCTCATGGCAGTGTTCC^GGCCACC^GCCyVAATT(^GCAACATGAGCG^ 

GATAATGTTCCX3TGAACACATCTGTCTGTTACCAGATGATGTGAGGAAACAAGTCAAAAG^ 

GCACAAGCCAGCCTCAAGACTTACC^AATGAAGTCACTGGACTTTGGGCCCAGGGTCGCGAACTAATGA 

AGGAAGTCACAGAGCAGGAAAAGAGTGAAGTGOTGGGGAAGCTTCAGGAATTGCAGAGTGTOT 

TGTTTTACA/^AAGTGCAGTCACCGGTTACAAGAACTAGAGAAGAATTTGGTTTCTAGGAAGCATTTTAAG 

GAAGATTTTGATAAAGCTTGCCACTGGCTAAAAGAAGCAGATAT 

TGAATGAGAGTACTGAGCITCATACACAACTGGCTAAATACC^AAACATTCTT 

TGAAAATCTTCTACTTACGCTGCAGAGAACTGK3GCAGACCATATTACCATCGCTGAATGAAGTCGATCAT 
TCCTACCTCA.GTGAAAAGCTAAATGCTTTGCCTCGACAATTTAATGTAATTGTTGCCTTGGCTAAAGACA 
AGTTCTATAAAGTCC^GGAAGCAATTCTTGCTGGGAAGGAATATGCTTCCTTGATTGAGTTGACAACCO\ 
GTCTCTCAGTGAACTTGAAGCCCAATTCTTGAGGATGAGCAAAGTTCCCACCGACCTGGCCGTTGAGGAG 
GCTCTTTCTCTGCAAGATGGTTGCAGAGCCATTCTGGACGAGGTGGCGGGCCTTGGGGAGGCGGTGGATG 
AACTGAACCAGAAAAAAGAAGGTTTTCGCAGCACAGGTCAGCCTTGGCAGCCAGACAAGATGCTGCACCT 
TGTCACCTTATATCACAGGCTGAAGCGACAAA<^GAACAGAGGGTTAGCTTATTAGAAGACACCACCAGT 
GCTTACCAAGAACACGAGAAGATGTGCCAACAGCTGGAGAGACAACTGAAGTCTGTAAAAGAGGAGCAGT 
COU^GTGAATGAGGAAACGCTGCCTGCAGAGG^ 

CCAGGACTC^GGGATTGTACTGAAACGAGTAACCATACATCTTGAAGATCTTGCCCCACACCTTGACCCC 
TTGGCTTATGAGAAAGCCAGGCATCAGATCCAGTCCTGGCAAGGGGAGTTAAAACTGTTGACTTCTGCCA 
TTGGTGAGACGGTGACAGAATGTGAGAGCCGAATGGTGCAGAGTATAGACTTCCAGACTGAGATGAGTCG 
CTCCCTGGACTGGCTGAGGAGAGTGAAGGCAGAGCTCAGTGGGCCGGTGTACCTAGACCTC^ACCTGCAG 
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gacatccaagaggaaatcag&aaaatcca 

atgcgctgagtcacaaggaaaaggagaagttcacaaaggccaaggagctgatt^ 

c^gcctcgctgagctctcagagctggatggagac^tccaggaagcctt 

actgaaatatatagccagtgtcaaaggtattatcaggtatttc^ 

cccaagaaatgttacagctggcaggcaatggcctagacgtg 

catggaatttttcagtacagaggatcagttccatagtaacctggagg^ 

ctggacccactcatcaagccaaccggcaaagaagacctagaacagaaagtggcn^ 

gccagaggatgagccgggactctggtgcccaagtggatctcttgcagagatgcacagot 

ttaccagaaagcaa.gggaa.gaggttattgaattgatgaatgatacagaa^gaaattgt ctgagtttt ct 

ttgttgaagacttcx3tctagtcatgaagcggaagaaaaattgtca 

ttaactctttccatgagaaaattgtggcccttgaggaaaaagcttcacaactggagaaaaccgg 

tgccaggaaagccaccctgagcaggtcaatgaccaccgtc^^ 

gcccaggaccaggagaagatcctggaagatgcagtggatgagtggarc 

aggccactgaaatgattgatcagctgcaagataagttacc 

agagctcttaactcttcttgaataccacgacac 

GG GATGCTGCGGCAGCAAACCCTGAGCATGCTCCAGGATGGAGCQBCCCC^ 

CGCTCATGCAGGAAATCACCGCCATGCAAGATCGGTGCCTGAACATGC^ 

AAAGTTGGTGAAGCAAGAGCTGAAGGACCGAGAAATGGTGGAGACTCAGAT 

GTTCAGGAAACGAAAGAATATTTAGGGAATCCAACAATAGAAATA 

TTCTCCTAACAGAAGCCACAAATCACCGACAGAACATTGAAAAA^ 

CTTAGGTCTTTATACCATATTACCnTCTGAA.CTCTCCCTTCAGTTGGCTGAAGTGGCGTTAGATCTAAAG 

ATCCGAGATCAGATCGAAGACAAAATAAAAGAAGTTGAGCAG 

GG CAAATTGA.GAAGTTAGCTAAAGACCTCA.CAACTATTCTAACTAA.GCTGAA 

AGTTC^GCTAAAACTGACCAAAAGGTGCTGGGAGAGGAATTAGATGGCTGTAATTCAAAGTTAATGGAA 
TTAGATGCAGCAGTACAGAAATTCTTGGAACAGAATGGCCAACTGGGTAAGCCACTGGCCAA.GAAGATAG 
GAAAACTGACTGAACTTCACCAGC71GACCATTAGACAAGCTGAGAATCGGCT 

AGCATCAC^TTTAGAAGAATACAATGAAATGCTTGAATTAATTTTGAAGTGGATTGAAAAAGCTAAAGTC 

TTGGCTCATGGAACTATTGCATGGAATTCTGCAAGCCAG 

CCTTAGGAAAAATAATCTTTAAAAAGTAACCAAGGGCAATTTGATTTAAOTG 

TTAGAGGGCTGTGATGTAAAATTTTTGGAGCTACCAGATAAAAAGAATGCTAAGGTACCCCTAAGTTGTT 

CAGTAGTTGGACAGAAAGGAGCTTCTCATGAAATTTCATGAAATTGAATAAATAAATATCCTTGATCTTC 

CCTAAA.CCTACCTTACACCAAGACCCAAACCAATCAGCCTTGTAGAACTCATTTTCT 

AATAATTATCTGCAGGGATCTGGTGGGAAATTCTTTCCTGTGAAGAGATGCAATGAAGTGTGGAAAGATT 

CTAGACTCGACACTCAGACTGGTGGGAAAACCAACCTCCGCCATGCAGGGCTGTC 

TGCTTTGCCTCTCZTGAATTCTGTCTTCTCATCATTTGTATGAAGACGTAAATAATATTCGTATTTCAGAC 

TTATGAGATGAAGTGGTTTAAGGTACAGA.CGTGCAAACGTCTGCCTGGCACATGCAGGTGC 

GATCTCCCGCCTCCTCCCTCAGCCCTCACCCAGGCCTGTCATCTGGCCTTCCACAGGAGGTCGGGCAGCC 

<^GAGC^GC<^TGAGTCCACATCACATGCTGGCTATGTTAGTTCATTTCCrCTGAAGTTACATGAGA^ 

AATGTTCCTTTTCTGTCAGTC^OTTCATCC^GGAAATTATTTCATCCTTTTGTAACTTAAGCTTAAATTA 

GACACAGATAGTTAATAGGCTAGTTATCATATAATAAAATATAGGGTGACTTTTATAGGAGTTACATGGG 

TATCGAGTATTCTAGATTTTTGTCTCTTATATTATTTATGTATCCTTGTGGCCTTTAAATGAATCCCTGT 

TTCCATTCTTGTTTACAGGGTTCTAGATCAAAGCCTCATTTTTCCATTTTTGGAATGCTTTAACAG 

TAATTTTTCCCTATATTC^CAGTCTTTCTTCTCTGATCAATCT 

TAAGCAACTCCy^TCTTTTGCTTTAAGATATGCTTAGATATGAACAGACAGGACTTAAGTTACCACTGAT 
TTGAAAACAATGAAAAAAAGCCAACATCCTTAGAAGTCTAGAAATG^ 

AAGAAAGACAAACTTAACTGTCACATTCATACTGTTTCTTTCAAGTTCATATTTAAGGAAGTGAGAGCTC 
TCAAACATTGCTGGTATCCTGGTAAAATCTCTTTGAAAAATA^^ 

AATGTTGCTACTCTGGGCCACGTGCGGTGGCTCACACCTGTAGTCCCAGCACTTTGGGAGGCCGAGGTGG 
GTGGATCACAGGGTCAGGAGTTCGGGACCGGCTTGGCCGGTATGGTGAAGCCCCATCTCTACTAGGAGTG 
CAAAAGTCAGCTGGGCGTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAATCTGAGGCGGGAGAATC 
GCTTGAACTCGGGAGGTGGAGGTTGCAGTGAGGCAAGATCATGCCACTGCACTCCAGCCTGGGTGACAGT 
GAGACTCCATCTC 

Human SYNE1 mRNA sequence - varl 7 (public gi: 2052166D (seq id no: 199) 

GTTGGATTTCTCTAATGGAAAATGTTATTCAGAAGGATGAAGATAATATTAAAAATTCCATAGGTTACAA 

GGCAATTCATGAATACCTTCAGAAA.TATAAGGGTTTTAAGATAGACATTAACTGTAAAGAGOT 

GATTTTGTGAACCAGTCCGTGCTACAAATCAGCAGTCAGGATGTGGAAAGTAAGCGTAGTGATAAGACTG 

ATTTTGCTGAGCAACTTGGAGC^TGAATAAAAGTTGGCAAATTCTGCAAGGTCTAGTAACTGAGAAGAT 

CGAGCTGTTGGAAGGCTTATTGGAATCTTGGTCAGAATATGAA 

TTTGAAACCCAGGAAAAGAGACTAAAACAAC^GCATCGAATTGGAGATGAGGCTTCTGTTC^AAATGCAC 
TGAAAGACTGTCAGGATCTGGAAGATTTGATTAAAGCAAAAGAAAAAGAAGTAGAGAAAATTGAGCAGAA 
TGGACTTGCTTTGATTCAGAACAAGAAAGAAGACGTCTCTAGCATTGTCATGAGCACACTGCGAGAGCTC 
GGCCAAACCTGGGCAAATTTAGATCACATGGTTGGACAATTAAAGATACTGCTGAAATCAGTGCTTGACC 
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AATGGAGTAGTCACAAAGTGGCCTTTGACAAGATAAACAGTTACCT 

CCGATTCCGTCTGCTGACTGGCTCCTTAGAAGCTGTC 

GATGATCTGGAAAAACAGGAAAGGAGCTTACAGAAATTTGGCTCT 

CTCACCCACCCGTGACAGAAACTCTTACC^TAC^CTGAAAGAAGTCAACATGAGATGGAAT 

GGAAGAGATTGCTGAGCAGCTACAGTCCAGCAAGGCCCTACTT 

TCCAAACAGTGTGCTTCGACAGTTCAGCAGCAGGAGGATGGAACCAATGAGCT 

ACAAGGACATTGCCX3ATGATGAGGITGCCACATGGATTCAAGATT 

(^GAGTTAAAGATTCCCTCTTTGTTCTCCA^ 

GCAGCATCAGCTATTCAATCGGATCAACTCTCTTTGAGTCAACACTTGTGTGCCCTGGAGCAAGCT 
GCAAACAGCAGACTTCATTACAGGCTGGAGTTCTTGATTATG^^ 

GGAGGCCTGGATAGTGGAAGCTGAAGAAATACTACAAGGGCAGGACCCTAGCCACTCATCTGACCT 
ACAATCCAGGAAAGGATGGAAGAACTTAAGGGAGAGATGT^ 

ACCX5TCTAAATGAGCTTGGATATAGGTTACCCTTGAATGATAAGGAAATCAAAAGAATGCAGAATCT 

CCGCCATTGGTCTCTGATCTCCTCTCAGACT^ 

CATCAGACTTTCTTGGAAAAATGTGAAACATGGATG^ 

TAGAGATTTCAGGAAATTATCAGCACCTTTTGGAACAGCAGAGAGCACACGAGOT 
GTTCAGTCGTCAGCAGATTTTGCACTGAA/TC^ 

GATGACAGGGATGAATTCAACCTGAAATTGACACTCCTCAGTAATCAATGGCAGGGAGTGATTCGCAGGG 

CCCAGCAGAGGCGGGGGATC^TTGACAGCCAGATTCGCCAGTGGCAGCGCTATAGGGAGATGGCAGAAAA 

GCTTCGTAAATGGTTGGTTGAAGTGTCCTACCTCCCCATGAGTGGTCTCGGAAGTGTTCCTATACCACTG 

CAACAAGC^GGACCCTCTTTGATGAAGTGCAGTTCAAAGAAAAAGTC^ 

ACATCCTGACTGTGGAGGCTGGCAAGCAACTCCTTCTCT 

GGCCGAACTCGCTGAAATCCAAGAGAAATGGAAATCAGCCAGCATGCGGCTGGAAGAACAGAAGAAAAAA 
CTAGCCTTCTTGTTGAAAGACTGGGAAAAATGTGAGAAAGGAATAGCAGATTCCCTGGAGAAACTACGAA 
CTTTCAAAAAGAAGCTTTCGC^GTCTCTCCCGGATCACCATGAAGAGCTCCATGCAGAACAAATGCGTTG 

caaggaattagaaaatgcagttgggagctggacagatc 

ctctctgcctatatcagtgctgatgatatctccattcttaatgaacgcgtagagcttctgcaaaggcagt 

gggaagaactatgccaccagctctccttaaggcggcagcaaataggtgaaagattgaatgaatggggagt 

cttcagtgaaaagaacaaggaactctgtgagtggttgactcaaatggaaagcaaagtttctcagaatgga 

gacattctcattgaagaaatgatagagaagctcaagaaggattatcaagaggaaattgct 

agaac^aaatac^gctccaac^aatgggagaacgacttc^ 

gattgaatacaagctgggaaaggtcaacgaccggtggcagcatctcctggacctcattgcagccagggtg 

aagaagctgaaggagacgctggtagccgtgcagcagcttgataagaacatgagcagcctgaggacctggc 

tcgctcacatcgagtcagagctggccaagccaatagtctacgattcctgtaactcggaagaaatacagag 

aaagcttaatgagcagcaggagcttcagagagacatagagaagcacagtacaggtgttgcatctgtcctc 

aacctgtgtgaagtcctgctgcacgactgtgacgcctgtgccactgatgccgagtgtgactctatacagc 

aggctacgagaaacctggaccggcggtggagaaacatttgtgctatgtccatggaaaggaggctgaaaat 

cgaagagacgtggcgattgtggcagaaatttctggatgactattcacgttttgaagattggctgaagtct 

tcagaaaggacagctgcttttcccagctcttctggggtgatctatacagttgccaaggaagaactaaaga 

aatttgaggctttccagcgacaggtccacgagtgcctgacgcagctggaactgatcaacaagcagtaccg 

ccgcctggccagggagaaccgcactgattcagcatgtagcctcaaacagatggttcacgaaggcaaccag 

agatgggacaacctgcaaaagcgtgtgacctccatcttgcgcagactcaagcattttattggc 

aggagtttgagactgcgcgggacagcattctggtctggctcacagagatggatctgcagctcactaatat 

tgaacatttttctgagtgtgatgttcaagctaaaataaagcaactcaaggccttccagcaggaaatttca 

ctgaaccacaataagattgagcagataattgcccaaggagaacagctgatagaaaagagtgagcccttgg 

atgcagcgatcatcgaggaggaactagatgagctccgacggtactgccaggaggtcttcgggcgtgtgga 

aagataccataagaaactgatccgcctgcctctcccagacgatgagcacgacctctcagacagggagctg 

gagctggaagactctgcagctctgtcggacctgcactggcacgaccgctctgcagacagcctgctttctc 

"cacagccttcctccaatctctccctctcgctcgctcagcccctccggagcgagcggtcaggacgagacac 

cccagctagtgtggactccatccccctggagtgggatcacgactatgacctcagtcgggacctggagtct 

gcaatgtccagagctctgccctctgaggatgaagaaggtcaggatgacaaagatttctacctccggggag 

ctgttgccttatcaggggaccacagtgccctagagtcacagatccgaca^ctgggcaaagccctggatga 

tagccgctttcagatacagcaaaccgaaaatatcattcgcagcaaaactcccacggggccggagctagac 

accagctacaaaggctacatgaaactgctgggcgaatgcagtaggagtatagactccgtgaagagactgg 

agcacaaactgaaggaggaagaggagagccttcctggctttgttaacctgcatagtaccgaaacccaaac 

ggctggtgtgattgaccgatgggagcttctccaggcccaggcattgagcaaggagttgaggatgaagcag 

aacctccagaagtggcagcagtttaactcagacttgaacagcatctgggcctggctgggggacacggagg 

aggagttggaacagctccagcgtctggaactcagcactgacatccagaccatcgagctccagatcaaaaa 

gctcaaggagctccagaaagctgtggaccaccgcaaagccatcatcctctccatcaa.tctotgca 

gagttcacccaggctgacagcaaggagagccgggacctgcaggatcgcttgtcgcagatgaatgggcgct 

gggaccgagtgtgctctctgctggaggagtggcggggcctgctgcaggatgccctgatgcagtgccaggg 

tttccatgaaatgagccatggtttggttcttatgctggagaacattgacagaaggaaaaatgaaattgtc 

cctattgattctaaccttgatgcagagatacttcaggaccatcacaaacagcttatgcaaataaagcatg 
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agctgttggaatcccaactcagagtagcctcttt 

aggaacagactgtttagaagcc^aagaaaaagtccatgs^ 
gaggtcagtcgtcatatcaaggaa.ctggagaagttattagacgtgtcaa 

cctggtcttctgctgatgaactggacacctcagggtctgtgagt 

CAGACAGAAAACGCCACXxAXjGCAAGTGTAGTCTCT 

AGGTCGA£AAAAGGTGGCTCCGATTCCTCCCTTTCTC 

TGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTTCTCCTC 

TGTACCAATCTCAGAGGAAGACTAGAGCTGTGCCCTCTCCAACAACTTTGCC 

CTCAGATACACGAATGGCCCTCCTCCACTCTGAACTAAGCAGAT^ 

AAGGAGGATCGGGTCATAAGCAATCCCAAACTACCAACAAGAGGACCTTGA 

TGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAAT^ 

TGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCTGG^ 

AAGGAGATTGAGTAAGGACCTCGAAAGTTCCCCGGACTCA^ 

ACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTCGCATGGT^ 

CCCTCCCAAGCACCATGTCAGTGTCGTACAATCTACCAACCAACCAGTGCTGAAGAGATTTTAGAACCT^ 
GTAACATACAATTTTTAAGAGCTTATATGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGG 
TTTAACCTTTGCTTTAGAAGCACAAGCTGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATAAAG 
CTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE1 mRNA sequence - varl8 (public gi: 28195688) (sbq id no: 200) 

TGTTCTCACGAGGGGGCCAGCTTGGGGCTTGACTGAGCAGGAGCTTC 

ATGTGACCTCTGCACATTGTTTACAGTTCCrAAACTGTGATTTCTTTTTCTGT^ 

AAGTGGCTACCAAGTAGAAAGTGGTCATGGGGGGTGAAGGTTAAACACAATAAC 

CACAGGGCATTTTATGCCAAGCTATATTGAATATCT^ 

TTGACAATTCACTCTAGACCCTGCTAGAAGAATCC^U^GAAATTC 

GAAATTACAGTACCTCACTAGCGTGTACTGTAC31GAAAAAA 

GAGACTGAGGAGTTGCGACAGATGATCAAAATTCGTTTGCAGAACCTCCAAGATGCAGCTAAGGATATGA 
AAAAATTTGAAGCAGAGTTGAAAAAGTTACAAGCTGCCTTGGAGC^ 

AGAAGTTGGACGTCTCAGTCTCAAGGAGGAGCTCTCTCATCGGCAGCATTTGTTGTCTGAGATGGAGTCA 
CTGAAGCCGAAGGTGCAAGCAGTGCAGCTCTGCCAGAGTGCCCTCCGGATCCCCGAGGATGTGGTTGCC^ 
GCTTACCTCTCTGTCATGCTGCTCTGCGGCTGCAGGAAGAGGCC^ 
GCAGTGTAACATCATGCAGGAAGCTGTGGTACAATATGAACAATATG^ 

CAACTGATAGAAGGAGCTCACAGAGAGATTGAGGATAAACCTGTTGCCACCAGTAACATACAGGAGCTGC 

AGGCTCAGATTTCTCGGCATGAGGAGCTGGCGCAGAAAATTAAGGGCTACCAGGAGCAGATCGCTTCTTT 

GAATTCCAAGTGCAAGATGCTGACGATGAAAGCCAAGCACGCCACCATGCTGCTGACGGTGACCGAGGTC 

GAGGGGCTGGCGGAAGGGACAGAGGACCTGGATGGGGAGCTCCTCCCCACGCCTTCGGCCCACCCCTCTG 

TGGTCATGATGACTGCAGGTCGCTGTCACACTTTGCTGTCACCGGTCACTGAGGAGTCTGGGGAGGAGGG 

AACCAACAGTGAGATTTCCTCTCCACCTGCCTGTCGCTCCCCTTCACCTGTGGCTAATACAGATGCTTCT 

GTTAACCAGGACATTGCATATTACCAAGCCTTGTCTGCTGAGAGGTTGCAGACAGATGCTGCAA/^AAOT 

ACCCCAGCACATCCGCATCCCAGGAGTTCTATGAACCGGGATTGGAGCCATCCGCTACTGCCAAACTGGG 

TGATTTGCAGCGTTCTTGGGAAACCTTAAAGAATGTGATCAGTGAGAAGCAGCGCACACTCTATGAAGCT 

TTGGAGCGCCAGCAGAAGTACCAGGACTCCCTCCAGTCCATCTCTACGAAGATGGAGGCC^ 

AACTCAGTGAGAGCCCAGAGCCTGGCAGGAGTCCAGAAAGCCAGATGGCTGAACATCAGGCATTGATGGA 

TGAGATTCTCATGCTCCAGGATGAAATCAATGAGCTCCAGTCCTCTCTCGCAGAGGAGCTGGTATCCGAG 

TCTTGTGAGGCCGACCCTGCGGAGCAGCTGGCCTTGCAGTCCACGCTCACTGTCTTAGCCGAGCGAATGT 

CCACCATCAGGATGAAAGCCTCGGGGAAACGGCAGCTTTTGGAGGAGAAGTTGAATGATCAGCTGGAGGA 

ACAAAGGCAGGAACAGGCCCTGCAGAGGTATCGCTGTGAAGCCGATGAGCTGGACAGCTGGCTCTTGAGT 

ACCAAGGCC^CTCTGGACACTGCGCTGAGTCCACCCAAGGAGCCCATGGACATGGAGGCCGAGCTTATGG 

ACTGCCAGAATATGCTGGTGGAAATAGAGCAGAAGGTGGTGGCTTTATCAGAACTGTCAGTCCACAATGA 

GAACCTGCTGCTGGAGGGCAAAGCTCACACCAAGGACGAGGCCGAGCAGCTGGCTGGAAAGCTGAGAAGG 

CTCAAGGGGAGCCTGCTGGAGCTGCAGAGAGCCCTGCATGATAAGCAGCTCAACATGCAGGGAACAGCAC 

AGGAGAAGGAGGAGAGCGATGTTGACCTAACAGCCACGCAGAGCCCCGGCGTCCAGGAATGGCTGGCCCA 

AGCTCGCACCACATGGACCCAGCAGCGGCAGAGCAGTCTCC^GCAACAAAAAGAGTTAGAACAGGAATTA 

GCCGAGCAGAAGAGTCTCCTTCGCTCAGTAGCCAGTCGTGGAGAGGAGATTCTAATTCAACATTCGGCGG 

CAGAGACCTCTGGTGATGCTGGCGAAAAACCTGATGTGTTATCCCAGGAGTTGGGGATGGAAGGGGAGAA 

ATCATCCGCTGAAGACCAGATGAGAATGAAATGGGAAAGCCTACATG^ 

CTACTACAGAATGTTCTGGAACAGGAACAAGAGCAAGTGCTTTATAGCAGGCCAAATCGACTCTTGTCTG 

GTGTGCCACTGTACAAAGGGGACGTGCCAACCCAAGATAAATCTGCAGTTACATCTTTGCTGGATGGACT 

GAACCAAGCCTTCGAGGAGGTTTCATCCCAGAGTGGAGGGGCAT^AGAGGCAGAGTATACACTTGGAGCAG 

AAGTTGTATGATGGAGTCTCAGCCACCTCTACTTGGTTGGATGACGTTGAAGAACGTTTATTTGTTGCCA 

CAGCACTTTTACCAGAAGAAACAGAGACTTGTCTCTOCAACCAAGAGATTCTTGCC^ 

AATGTCTGAAGAAATGGATAAGAACAAAAACTTGTTTTCCCAAGCTTTTCCAGAGAATGGTGATAATCGA 

GATGTTATTGAAGATAOTTTGGGTTGTCTTTTGGGCAG 



Figure 36 part - 113 



158/202 



WO 2004/078130 PCT/US2004/006308 

GATGTC^TCAGATGAAAGAAAGACTTCAGCAAATA^ 

ATCACTGGCTGACAACAAATACATCATTCTGCAAAAACTGGCAAATGTO 

CAAATAGAGGCAATACAACAGGCTGAAGATGGACTCAAAGAATTO 

GGCGTGGTGACGAGCTACAGGTCGAGGAGCCGTCCATGCAAGAACTCTCC 

TGAGCTGATGATGATCATTGGCTCCCX3GAGGAGTGGTCTGAAT 

GAGAGGGC^CTACAAGATCTGGCTGACCTGCTAGAAACTGGTCAG 

TCATCGTGTCTTCCAAAGAGGAAAT CCAGCAACCACTTGAC 

^VTCTCATATGATCTTGACTGTAACACTCTTCAGAAAGA 

TTCCATACAGAGCTGATGGCTCAGGCTTCTGCTGTACTGAAACGGGCTCACAAG 

AGTACATTCTAGAGACGTGGTCCCATCTGGATGAGGACCAGCAGGAGCTCAGCAGACAGCT^ 

GGAAAGCAGCATCCCAAGCGTGGGTCTGGTGGAGGAGAACGAGGACAGGCTTATTGACC^CATAACAC 

TACCAGCATTTAAAATCTAGCXJTTAATGAATACCA 
GACTTCTGATATCCATCAGCTGCTCAGATCTAGAAAGCC^^ 

TAACACCAATAAAATGTCTAAGGAACTTCACAGACTGGAAACAATATTGAAACACT 

AGTGAATCTGCAGATCTAATTCACTGGTTACAATCTGCAAAAGACCGGCT 

CTGTGACAGTCCCACAAGAGCTGGAAATGGTCCGTGATCATCTAAATGCTTTCCTGGAGTTTT 

AGTGGATGCCC^TCTTCCCTGAAATCATCTGTTCTGAGTACTGGAAATCAGCTCOT 

GTGGACACAGCCACGCTGCGCTCTGAGTTGTCGCGCATTGATAGC^ 

TCCCAGCCGTCCAGGAGAAGCTCCACCAGCTCCAGATG 

AGTGATGAGTTGGACTTCTCTAATGGAAAATGCTATTCAGAAGGATGAAGATAATATTAAAAATTCCATA 
GGTTACAAGGCAATTCATGAATACCTTCAGAAATATAAGGGTTTTAAGATAGACATTAA 
TGACAGTGGATTTTGTGAACCAGTCCGTGCTACAAATCAGCAGTCAGGATGTGGAAAGTAAGCGTAG 
TAAGACTGATTTTGCTGAGCAACTTGGAGCAATGAATAAAAGTTGG 

GAGAAGATCCAGCTGTTGGAAGGCTTATTGGAATCTTGGTCAGAATATGAAAATAATGTACAATGTCTGA 

AAACATGGTTTGAAACCCAGGAAAAGAGACTAAAAC^CAGCATCGAATTGGAGATCAGGCTTCTGTT^ 

AAATGCACTGAAAGACTGTCAGGATCTGGAAGATCTGATTAAAGCAT^AAGATAAAGAAGTAGAGAAAATT 

GAGCAGAATGGACTTGCTTTGATTC^GACCAAGAAAGAAGACGTCTCTAGmTTGTCATGAGCACACTGC 

GAGAGCTCGGCCAAACCTGGGCAAATTTAGATCACATGGTTGGACAATTAAAGATACTGCTGAAATCAGT 

GCTTGACCAATGGAGTAGTC^CAAAGTGGCCTTTGACAAGATAAACTVGTTACCTCATGGA 

TCTCTTTCCCGATTCCGTCTGCTGACTGGCTCCTTAGAAGCTGTGCAAGTTCAGGTGGACAATCT 

ATCTCCAAGATGATCTGGAAAAAGAGGAAAGGAGCT 

AAAAGAGTGTCACCCAC C CGTGACAGAAACTCTTACCAATACACTGAAAGAAGTCAACATGAGATGGAAT 

AACTTGCTGGAAGAGATTGCTGAGCAGCTACAGTCCAGCAAGGCCCTACTTC^ 

AGGACTACTCC^AACAGTGTGCTTCGACAGTTCAGCAGCAGG 

AGCCACAAACAAGGACATTGCCGATGATGAGGTTGC(^CATGGATTCAAGATTGCAACGACCTCCTC^ 
GGACTGGGCACAGTTAAAGATTCCCTCTTTGTTCTCCATGAGCTGGGAGAGCAACTGAAGCAACAAGTGG 

ATGCTTCCGCAGCATGAGCTATTCAATCGGAT 

AGCTCTCTGCAAACAGCAGACTTCATTACAGGCTGGAGTTCTTGATTATGAAACCTTTGCCAAGAGTTTA 
GAAGCTTTGGAGGCCTGGATAGTGGAAGCTGAAGAAATACTACAAGGGCAGGACCCTAGCCACTCATCTG 
ACCTCTCCACAATCCAGGAAAGGATGGAAGAACTTAAGGGACAGATGTTAAAATT 

AGATTTAGACCGTCTAAATGAGCTTGGATATAGGTTACCCTTGAATGATAAGGAAATCAAAAGAATGCAG 
AATCTGAACCGCCATTGGTCTCTGATCTCCTCTCAGACTACAGAAAGATTC^GCAAGTTGCAGTC^TTTT 
TGCTACAACATC^GACTTTCTTGGAAAAATGTGAAACATGGATGGAATTCCTAGTTCAGACAGAACAA 

GTTAGCAGTAGAGATTTCAGGAAATTATCAGCACCTTTTGGAACA^ 
GCCGAGATGTTCAGTCGTCAGCAGATTTTGCACTCAATCATTATTGATGGGGAAC 

GTCAAGTTGATGACAGGGATGAATTCAACCTGAAATTGACACTCCTCAGTAATCAATGGCAGGGAGTGAT 
TCGCAGGGCCCAGCAGAGGCGGGGGATCATTGACAGCCAGATTCGCCAGTGGCAGCGCTATAGGGAGATG 
GCAGAAAAGCTTCGTAAATGGTTGGTTGAAGTGTCCTACCTCCCCATGAGTGGTCTCGGAAGTGTTCCTA 
TACCACTGCAACZAAGCAAGGACCCrCTTTGATGAAGTGCAGTTCAAAG 

AGGCAGCTACATCCTGACTGTGGAGGCTGGCAAGCAACTCCTTCTCTCGGC - 
GCCTTGCAGGCCGAACTCGCTGAAATCCAAGAGAAATGGAAATCAGCCAGCATGCGGGTGGAAGAACAGA 
AGA7\AAAACTAGCCTTCTTGTTGAAAGACTGGGAAAAATGTGAGAAAGGAATAGCAGATTCCCTGGAGAA 
ACTACGAACTTTCAAAAAGAAGCTTTCGCAGTCTCTCCCGGATC^ 

ATGCGTTGCAAGGAATTAGAAAATGCAGTTGGGAGCTGGACAGATGACTTGACCCAGTTGAGCCTGCTGA 
AGGACACCCTCTCTGCCTATATCAGTGCTGATGATATCTCCATTCTTAATGAACGCGTAGAGCTTCTGCA 
AAGGCAGTGGG AAGAACTATG CCAC CAGCTCTCCTTAAGGCGG CAGCAAATAGGTGAAAGATTGAATGAA 
TGGGC&GTCOTCAGTGAAAAGAACAAGGAACTCTGTGAGTGGTTGACT 

AGAATGGAGACATTCTCATTGAAGAAATGATAGAGAAGCTCAAGAAGGATTATCAAGAGGAAATTGCTAT 
TGCTCAAGAGAACAAAATACAGCTCCAACAAATGGGAGAACGACTTG 

GCATCTGAGATTGAATACAAGCTGGGAAAGGTCAACGACCGGTGGCAGCATCTCCTGGACCTCATTGCAG 
CCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCCGTGCAGCAGCTTGATAAGAACATGAGCAGCCTGAG 
GACCTGGCT CGCTCACATCGAGTCAGAGCTG GCCAAGCCAATAGTCTACGATTCCTGTAACTCGGAAGAA 
ATACAGAGAAAGCTTAATGAGCAGCAGGAGCTTCAGAGAGACATAGAGAAGCACAGTACAGGTGTTGCAT 
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rTGTCCTCAACCTGTGTGAAOTC 

TAT ACAGCAGGCTACGAGAAACCTGGACCGGCGGTGGAGAAAC^ CATGGAAAGGAGG 
OTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAAATTT CTGGATGACTATTCACGTTTTGAAGATTGGC 
TGAAGTCTTCAGAAAGGACAGCTGCTm 

CAGTACCGCCGCCTGGCCAGGGAGAACCGCACTGATTCAGCATGTAGCCTCA 

GCAACCAGAGATGGGACA&CCTGCAAAAGCGTGTCACCTCCATCTTGC 

CCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCATTCTGGTCTGGCTCACAGAG 

ACTAATATTGAACATTTTTCTGAGTGTGATGTTCAAGCTAAAATAAA 

AAATTTCACTGAACC^CAATAAGATTGAGCAGATAATTGCCCAAGGAGAACA 

GCCCTTGGATGCAGCGATCATCGAGGAGGAACTAGATGAG 

CGTGTGGAAAGATACCATAAGAAACTGA^ 

GGGAGCTGGAGCTGGAAGACTCTGCAGCTCTGTCGGACCTGCACTGG 

GCTTTCTCCACAGCCTTCCTCCAATCTCTCCCTCTCGCTCGCTCAGC 

CGAGACACCCCAGCTAGTGTGGACTCCATCCCCCTGGAGTGGGATCACGACTATGACCTCA 

TGGAGTCTGCAATGT CCAGAGCTCTG CCCTCTGAGGATGAA.GAAGGTCAGGATGACAAAGATTTCTACCT 

CCGGGGAGCTGTTGCCTTATCAGGGGACCACAGTGCCCTAGAGTCACAGATCCGACAACTGGGCAAAGCC 

CTGGATGATAGC03CTTT<^GATACAGCAAACCGAAAATATCATTCGCAGCAAA^ 

AGCTAGACACCAGCTACAAAGGCTACATGAAACTGCTGGGCGAATG^^ 

GAGACTGGAGCACAAACTGAAGGAGGAAGAGGAGAGCCTTCCTGGCTTTGTTAACCTGCATAGTACCGAA 

ACCCAAACGGCTGGTGTGATTGACCGATGGGAGCTTCTCCAGGCCCAGGCATTGAGCAAGGAGTTGAGGA 

TGAAGCAGAAGCT CCAGAAGTGGCAGCAGTTTAACT CAGACTTGAACAGCATCTGGG CCTGGCTGGGGGA 

CACGGAGGAGGAGTTGGAACAGCTCCAGCGTCTGGAACTCAGCACTGACATCCAGACCATCGAGCTCCAG 

ATCAAAAAGCTCAAGGAGCTCCAGAAAGCrrGTGGACCACCGCAAAGCCATCATCCTCTCCATCAATCT 

GCAGCCCTGAGTTCACCCAGGCTGACAGCAAGGAGAGCCGGGACCTGCAGGATCGCTTGTCGCAGATGAA 

TGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGCGGGGCCTGCTGCAGGATGCCCTGATGCAG 

TGCCAGGGTTTCCATGAAATGAGCCATGGTTTGCTTCTTATGCTGGAGAACATTGACAGAAGGAAAAATG 

AAATTGTCCCTATTGATTCTAACCTTGATGCAGAGATACTTCAGG 

AAAGCATGAGCTGTTGGAATCCCAACTCAGAGTAGCCTCTTTGCAAGACATGTCTTGCCAACTACTGGTG 
AATGCTGAAGGAACAGACTGTTTAGAAGCCAAAGAAAAAGTCCATGTTATTGGAAATCGGCTCAAACTTC 
TCTTGAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAGTTATTAGACGTGTCAAGTAGTCAGCAGGA 
TTTGTCTTCCTGGTCTTCTGCTGATGAACT.GGACACCTCAGGGTCTGTGAGTCCCACATCAGGAAGGAGC 
ACCCCAAACAGACAGAAAACGCCACGAGGCAAGTGTAGTCTCTCACAGCCTGGACCCTCTGTCAGCAGTC 
CACATAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTTTCTGAGCCAGGGCCAGGTCGGTCCGGCCG 
CGGCTTCCTGTTCAGAGTCCTCCGAGCAGCTCTTCCCCTTCAGCTTCTCCTGCTCCTCCTCATCGGGCTT 
GCCTGCCTTGTACC^TGTCAGAGGAAGACTACAGCTGTGCCCTCTCCAACAACTTTGCCCGGTCATTCC 
ACCCC^TGCTCAGATACACGAATGGCCCTCCTCCACTGTGAACTAAGC^GATGCC^TCTGC^GAAGTGCT 
GGTAGCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTACCAACAAGAGGACCTTGATCTTGGCGAAA 
GCCCTCGGTGTGGCAGCTTTAGCCCTCCTCCAGATCACATGTGTGCAAATTATGGCTTCAGAGGTGGAAG 
ATAAACAGTGACGGGGGAACAAACAGACAACAAGAAGGTTTGGAAGAAATCTGGTTTGAGACTCTGAACC 
TTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCCGGACTCATGAATTCTGGGCCCTTGGCCCA 
TTCTGTGCACAGCCAAGGACTTCAGTAGACCATCTGGGCAGCTTTCCCATGGTGCTGCTCCAACCATCAG 
ATAAATGACCCTCCCAAGCACCATGTCAGTGTCGTACAATCTACCAACCAACCAGTGCTGAAGAGATTTT 
AGAACCTTGTAACATACAATTTTTAAGAGCTTATATGGCAGCTTCCTTTTTACCTTGTTTTCCTTTGGGG 
CATGATGTTTTAACCTTTGCTTTAGAAGCA.GAAGCTGTAAATCTAAAAGGCACTTTTTTTTAGAGGTATA 
AAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTTATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE1 niRNA sequence - varl9 (public git 28195676) (seq id noi 201) 

CAAGGGGAAACTTTCATCCCCACGCAGGTTATAGCTTTTTGCTCTGCAGAGTCTAACTTTTGCAAGTGGA 
AGCTTCATGGTGGTGGCGGAGGACCTGAGTGCCCTGAGGATGGCAGAGGACGGCTGTGTGGATGCAGATC 
TCCCAGATTGTAACTGCGATGTCACAAGGGCCAGGGTGAAGAAGCTGAAGGAGACCCTGGTAGCCGTGCA 
GCAGCTTGATAAGAACATGAGCAGCCTGAGGACCTGGCTCGCTCACATCGAGTCAGAGCTGGCCAAGCCA 
ATAGTCTACGATTCCTGTAACT^GGAAGAAATACAGAGAAAGCTT^ 

ACATAGAGAAGCACAGTACAGGTGTTGCATCTGTCCTCAACCTGTGTGAAGTCCTGCTGCACGACTGTGA 

CGCCTGTGCCACTGATGCCGAGTGTGACTCTATACAGCAGGCTACGAGAAACCTGGACCGGCGGTGGAGA 

AACATTTGTGCTATGTCCATGGAAAGGAGGCTGAAAATCGAAGAGACGTGGCGATTGTGGCAGAAATTTC 

TGGATGACTATTCACGTTTTGAAGATTGGCTGAAGTCTTCAGAAAGGACAGCTGCTTTTC 

TGGGGTGATCTATACAGTTGCCAAGGAAGAACTAAAGAAATTTGAGGCTTTCCAGCGACAGGTCCACGAG 

TGCCTGACGCAGCTGGAACTGATCAACAAGCAGTACCGCCGCCTGGCCAGGGAGAACCGCACTGATTCAG 

CATGTAGCCTCAAACTIGATGGTTCACGAAGGC^CCAGAGATGGGACAACCTGCAAAAGCGTGTCACCTC 

CATCTTGCGCAGACTCAAGCATTTTATTGGCCAGCGTGAGGAGTTTGAGACTGCGCGGGACAGCATTCTG 

GTCTGGCTCACAGAGATGGATCTGCAGCTCACTAATATTGAACATTTTTCTGAGTGTGATGTTCAAGCTA 

AAATAAAGC^CTC^GGCCTTCCAGCAGGAAATTTCACTGAACCAC^ATAAGATTGAGCAGATAATTGC 
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CCAAGGAGAACAGCTGATAGAAAAGAGT 

CTCCGACGGTACTGCCAGGAGGTCTTCGGGCGTGTGGAAAGATACCATAAG 

TCCCAGACGATGAGCACGACCTCTCAGACAGGGAGCTGGAGCTGGAAG 

GCACTGGCACGACCGCTCnXSCAGAC^GCCrGCTTTCT 

GCTCAGCCCCTCCX3GAGCGAGCX3GTC^GGACGAGACACCCCAGCTA 

GGGATGA.CGACTATGACCTCAGTCGGGACCTGGAGTCTGCAATGTCC^ 

AGAAGGTCAGGATGACAAAGATTTCTACCTCQ3GGGAGCT 

AGCCCTGAGGCCTATGTAAAACTCACAGAAAATGCAATCAAAAATACCTCCGGGGACCACAGTC 

AGTCACAGATCCGACAACTGGGCAAAGCCCTGGATGATAGCCGCTT^ 

CATTCXXIAGCAAAACTCCCACGGGGCCGGAGCTAGAC^ 

GAATGCAGTAGCAGTATAGACTCCGTGAAGAGACTGGAGCA.CAAACT 

CTGGCTTTGTTAACCTGCATAGTACCGAAACCCAAACGGCTGGTGTGATTGACCG 

GGCCCAGGCATTGAGCAAGGAGTTGAGGATGAAGCAGAACCTC 

TTGAA^GCATCTGGGCCTGGCTGGGGGACACGGAGGAGGAGTT^ 

GCACTGACAT CCAGACCATCGAGCTCCAGATCAAAAAGCTCAAGGAGCT C CAGAAAGCTGTGGACCACCG 
CAAAGCCATCATCCTCTCCATCAATCTCTGCAGC^ 

GACCTGCAGGATCGCTTGTCGCAGATGAATGGGCGCTGGGACCGAGTGTGCTCTCTGCTGGAGGAGTGGC 
GGGGCCTGCTGCAGGATGCCCTGATGC^ 

GCTGGAGAACATTGACAGAAGGAAAAATGAAATTGTCCCTATTGATTCTAACCTTGATGCAGAGATACT 
C^GGACCATC^CAAACAGCTTATGCAAATAAAGCATGAGC 

TGCAAGACATGTCTTGCCAACTACTGGTGAATGCTGAAGGAACAGACTGTTTAGAAGCCAAAGA 
CCATGTTATTGGAAATCGGCTCAAACTTCTCTTGAAGGAGGTCAGTCGTCATATCAAGGAACTGGAGAAG 
TTATTAGACGTGTCAAGTAGTCAGCAGGATTTGTCTTCCTGGTCTTCTGCTGATGAACTGGACACCTCAG 
GGTCTGTGAGTCCCACATCAGGAAGGAGCACCCCA^^ 

CT^CAGCCTGGACCCTCTGTCAGCAGTCCACATAGCAGGTCCACAAAAGGTGGCTCCGATTCCTCCCTT 
TCTGAGCCAGGGCCAGGTCGGTCCGGCCGCGGCTTCCTC 

AGCTTCTCCTGCTCCTCCTCATCGGGCTTGCCTGCCTTGTACCAATGTCAGAGG 

CCTCTCCAACAACTTTGCCCGGTCATTCCACCCCATGCTCAGATACACGAATGGCCCT 

ACTAAGCAGATGCCATCTGCAGAAGTGCTGGTAGCATAAGGAGGATCGGGTCATAAGCAATCCCAAACTA 

CCAACAAGAGGACCTTGATCTTGGCGAAAGCCCTCGGTGTGGCAGCTTTAGCCCTCCTCGAGATCACATO 

TGTGCAAATTATGGCTTCAGAGGTGGAAGATAAAC^GTGACGGGGGAACAAACAGACA 

GGAAGAAATCTGGTTTGAGACTCTGAACCTTAGCACTAAGGAGATTGAGTAAGGACCTCCAAAGTTCCCC 

GGACTCATGAATTOTGGGCCCTTGGCCCATTCTGTGCA.CAGCCAAGGACTTCAGTAGACCATCT 

CTTTCCCATGGTGCTGCTCCAACCATCAGATAAATGACCCTCCCAAGCACCATGTC^GTGTCGTACAATC 

TAC CAAC CAACCAGTG CTG AAG AGATTT T AGAACCTTGT AACATACAATTTT T AAGAG CTT ATATGGCAG 

CCTCCTTTTTACCTTGTTTTCCT^ 

TCTAAAAGGCACTTTTTTTTAGAGGTATAAAGAAAAACTAGATGTAATAAATAAGATCATGGAAGGCTTT 
ATGTGAAAAAAGTTGAATGTTATAGT 

Human SYNE 1 Protein sequence - varl (public gi: 21753085) (seq id no: 295) 

MVVDDLFEDMKDGVKIjIaAIjLE^^ 

RPSIVLGLMWTIILYFQIEEIjTSNLPQLQSLSSSASSTO 

VQYTAGKQTGIEVKDFGKSWRSGVAFHSVIHAIRPELVDLETVKGRSNRENLEDAFTIAETELGIPRIjLD 
PEDVDVDKPDEKS IMT YVAQFLKHYPDIHNASTDGQ 

EQFERDLtTRAQMVESNLQDKYQS FKHFRVQYEMKRKQ I EHL I QPLHRDGKIiSLDQALVKQSWDRVTSRIiF 
DWHI QLDKSI1PAPI1GTI GAWL YRAEVALREE I TVQQVHEETANT I QRKLEQHKDLLQNTDAHKRAFHE I Y 
RTRSVNGI PVPPDQIiEDMAERFHFVSSTSEIiHLMKMEFI^ 

QLIiQNYVS F I ENS KFFEQ YE VT YQ I L KQTAEM WKADGSVEEAENVMKFMNE TT AQWRNLS VEVRS VRSM 
LEEVI SNWDRYGNTVAS LQAWLEDAEKMLNQSENAKKDFFRNIjPHW I QQHTAMND AGNFLI ETCDEMVSR 
DIiKMLLLIiNGRWRELFMEVKQYAQADEMDRMKKEYTDCWTL 
DLEDIEQRVPVMDAQYKI ITKTAHLITKESP 

Human SYlSEl Protein sequence - var2 (public gi: 19584385) (seq id no: 296) 

KLLIQDLEDIEQRVPVMDAQYKIITKTAHLITKESPQEEGKE 

DLI PLEELEKQMTSFYDSLGKINEI ITVIaEREAQSSALFKQKHQELLACQENCKKTLTLIEKGSQSVQKF 

VTLSNVLKHFDQTRLQRQIADIHVAFQSIWKKTGDWKKHVETNSRLM 

KGDPEEIiLRRHTEFFSQLDQRVLNAFLKACDEL^ 

KIDVEKNRFLASVEECRTELDRETKLMPQEGSEKIira 

PVRDTPGTCHVTIiKELRAAIDSTYRKLMEDPDKWKDOT 

VIOUVVEEIRNGVTKRGETLSWLKSRLKVIiTE^^ 

dcvqykeivknsleelisgskevqeqaekildtei^fea 

EGGLPDRGHEEIiRKLESTLDGLERSRERQERRIQVTLRKWERFETNKETVVRYLFQTGSSHERFL^ 
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SSLSSEIJSQTKEFSKRTESIAVQ^ 

FVTTGESARI KAKLTQI RRYGEELREHAQCLSGTI LGHLSQQQKFEENIiRKI QQSVSE ^^^^Y?^?^ 

Latetyk^ehmdlcqalesi^sa^ 

IiLAHWQRIjEKELSSFLTWIjERGEAKASSPEI^ISADRVKVEGEIjp 
IIK 

Human SYNE1 Protein sequence - var3 (public gi: 17861378) <seq id no: 297) 

MGERIjAKASHESKASEIEYKIjGKVNDRWQHLLDLIAARVK^ 
AKPIVYDSCNSEEIQRKLNEQQELQRDIEKHST^ 

RWRNI CAMSMERRLKI EETWRLWQKFLDDYSRFEDWIiKS sertaafpss sgvi ytvakeelkkfeafqrq 
VHECLTQliEIjINKQYRRIiARENRTDSACSLKQMVHEGNQRWDNLQ 

SILVWLTEMDWLTNIEHFSECDVQAKIKQIjKAFQQEISIjNHNKIEQI IAQGEQLIEKSEPLDAAIIEEE 
LDELRRYCQEVFGRVERYHKKLI RLPLPDDEHDLSDRELELEDSAALSDLHWHDRSADSLLS PQPSSNLS 
LSLAOPLRSERSGRDTPASVDSIPLEWDHDYDLS^^ 

IPESPEAYVKLTENAIKNTSGDHSALESQIRQMKAIJDDSRFQIQQTENIIRSKTPTGP^ 

TjLGECSSSIDSVKI^EHKLiKEEEESLPGFVlsrLHST 

NSDLNSIWAWIjGDTEEELEQLQRIiELSTDIQTIEL^ 

ESRDUDDRLSQ^GRWDRVCSLI^ 

EILODHHKQMQIKHELLESQLRVASLQDMSCQLL 

LEKIoIJDVSSSQQDLSSWSSADEIJDTSGSVSPTSGRSTPNRQKTPRGKCSDSQPGPSVSSPHS 

SSLSEPGPGRSGRGFLFRVIJiAALPLQril^IiLIGIaACLVP^ 

PL 

Human SYNE1 Protein sequence - var4 (public gi: i786i386) (seq id no: 298) 

MELDAAVQKFLEQNGQLGKPLAKKIGKLTELHQQT I RQAENRLSKLNQATSHLEE YNEMLEL I LKWI EKA 
KVLAHGTIAWNSASQLRKQYILHQTLLEESKEIDSELEAMT^ 

RQMI KI RLQNLQDAAKDMKKFEAELKKLQAALEQAQATLTS PEVGRLSLKEQLSHRQHLLS EME SLKPKV 
OAVQLCQSAIiRI PEDWASLPLCHAALRLQEEASRIiQHTAI QQCNIMQEAWQYEQYEQEMKHLQQLI EG 
AHREIEDKPVATSNIQELQAQISRHEEIAQKIKGYQEQIASLNSKCKMLTMKAKHATML 
GTEDIJDGELIjPTPSAHPSVVMMTAGRCHTLLSPVTEESGEEGTNSEI ssppacrs PSPVANTDASVNQDI 
AYYQALSAERLQTDAAKIHPSTSASQEFYEPGLEPSATAKLGDLQRSWETLKNVISEKQRTLYEALERQQ 

KYQDSLQSISTKMEAIELKLSESPEPGRSPESQ^HQAL^ 
PAEQLALQSTLTVLAERMSTIRMKASGKRQL 

DTALS PPKEPMDMEAQLMDCQNMLVE I EQKWALSELSVHNENLLLEGKAHTKDEAEQLAGKLRRLKGSL 

LELQRALHDKQLNMQGTAQEKEE SDVDLTATQS PGVQEWLAQARTTWTQQRQSSLQQQKELEQELAEQKS 

LLRSVASRGEEILIQHSAAETSGDAGEKPDVLSQELGMEGEKSSAEDQMRMKMESLHQEFSTKQKLL 

LEQEQEQVLYSRPNRLLSGVPLYKGDVPTQDKSAVTSLLDGLNQAFEEVSSQSGGAKRQSIHLEQKLYDG 

VSATSTWLDDVEERLFVATALLPEETETCLFNQEILAKDIKEMSEEMDKNKNLFSQAFPE 

TLGCLLGRLSLLDSVWQRCTQMKERLQQII^^ 

OOAEDGLKEFDAGI I ELKRRGDELQVEQPSMQELSKLQDMYDEIjMMI IGSRRSGLNQNLTLKSQYERALQ 
DLADLLETGQEKMAGDQKI IVSSKEEIQQPLDKHKEYFQGLESHMILTVTLFRKIISFAVQKETQFHTEL 
MAQASAVLKRAHKRGVELEYILETWSHLDEDQQELSRQLEWESSIPSVGLVEENEDRLIDRITLYQHLK 
SSLNEYQPKLYQVLDDGKRLLISISCSDLESQLNQLGECWLSNTNKMSKELHRLETILKH^ 
LIHWLQSAKDRLEFWTQQSVTVPQELEMVRDHLNAPLEFSKETO 

LRSELSRIDSQWTDLLTNIPAVQEKI.HQLQMDKLPSRHAISEVMSWTSLMENAIQKDEDNIKNS 
HEYLQKYKGFKIDINCKQLTVDFVNQSVLQISSQDVESKRSDKTDFAEQLGAMNKSWQILQGLVTEKIQL 
LEGLLESWSE YENNVQCLKTWFETQEKRLKQQHRI GDQASVQNALKDCQDLEDLI KAKDKEVEKI EQNGL 
ALIQTKKEDVSS I VMSTLRELGQTWANIiDHMVGQIjKI LLKSVLDQWSSHKVAFDKINSYLMEARYSLSRF 
RLLTGSLEAVQVQVDNLQNLQDDLEKQERSLQKFGSI TNQLLKECHP PVTETLTNTLKEVNMRWNNLLEE 
I AEQLQS S KALLQLWQRYKDYSKQCAS TVQQQEDRTNELLKAATNKDI ADDEVATW I QDCNDLLKGLGTV 
KDSLFVLHELGEQLKQQVDASAASAIQSTO^ 

WIVEAEEILQGQDPSHSSDLSTIQERMEELKGQMLKFSSMAPDLDIILNELGYRLPLNDKEIKRMQNIjNIIH 
WSLI SSQTTERFSKLQS FLLQHQTFLEKCETWMEFLVQTEQKLAVEI SGNYQHLLEQQRAHELFQAEMFS 
RQQ I LHS 1 1 IDGQRLLEQGQVDDRDE FNLKLTLLSNQWQGVI RRAQQRRGI IDSQ I RQWQRYREMAEKLR 
KWLVEVSYLPMSGI^SVPIPLQQARTLFDEVQFKEKVFLRQQGSYILTVEAGKQLLLSADSGAEAALQM 

LAEIQEKWKSASMRLEEQKKKLAFLLKDWEKCEKGIADSLEKL 

LElsf AVGSWTDDLTQLSLLKDTLSAYI SADDI SI LNERVELLQRQWEELCHQLSLRRQQIGERLNEWAVFS 

EKNKELCEWLTQMESKVSQNGDILIEEMIEKLKKDYQEEIAIAQENKIQLQQMGERLAKASH 

YKLGKVNDRWQHLLDLIAARVKKLKETLVAVQQLDKNMSSLRTW^ 

NEQQELQRDIEKHSTGVASVLNLCEVLLHDCDACATDAECDS IQQATRKLDRRWRNI CAMSMERRLKI EE 
TWRLWQKFLDDYSRFEDWLKSSERTAAFPSSSGVIYTVAKEEDKKFEAFQRQVHECLTQLELINKQYRRL 
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ARENRTDS ACSLKQMVHEGNQRWDNLQKRVTS I LRRLKHF I GQREEFETARDS I LVWLTEMDLQLTNI EH 
FSECDVQAKI KQLKAFQQEISLNHNKIEQI IAQGEQIjI EKSEPLDAAI IEEELDEIjRRYCQEVFGRVERY 
HKKLIRIjPLPDDEHDLSDRELELEDSAA^ 

SVDSIPLEWDHDYDLSIUDIjESAMSRALPSEDEEGQDDKDFYIjRGAVALSGDHSALESQIM 
FQICXJTENIIRSKTPTGPELOTSYKGYMKLLGECSSSIDSVKRIjEHKIjKEEEE 

VIDRWELMAQALSKEIiRMKQNLQKWQQFNSDI^ 

ELQKAVDHRKAI I LS INLCSTEFTQADSKESRDLQDRliSQMNGRWDRVCSLLEEWRGLLQDALMQCQGFH 
EMSHGIjLIjMIjENIDRRKNE I VP IDSNLDAE I LQDHHKQLMQI KHELrLESQLRVASIiQDMS CQI1L1VNAEGT 
DCLEAKEKTOVIGNRLKLLLKEVSRHIKELEKLIiDVSSSQ 

KTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSLSEPGPGRSGRGFLFRVLRAALPL^ 
MSEEDYSCALSNNFARSFHPMLRYTNGPPPL » 

Human SYNE 1 Protein sequence - var5 (public gi: 17227154) (seq id no: 299) 

MRLEEQKKKLAFLLKDWEKCEKGIADSLEKI^ 

TQLSIiLKDTDS AYISADDI S ILNERVELLQRQWEEIiCHQLSLRRQQI GERLNE WAVFSEKNKELCEWLTQ 

MESKVSQNGDILIEEMlEKIiKKDYQEEIAIAQENKIQLQQMGERIiAKASHESKASEIEYKIiGK^ 

LIJDLIAARVKKLKETLVAVQQIjDKNMSSLRTWI^ 

■HSTGVASVLNLCEVLLHDCDACATDAECDS I QQATRNLDRRWRNI CAMSMERRLKI EETWRIiWQKFLDDY 
SRFEDWLKSSERTAAFPSSSGVIYTVAKEELKICFEAFQRQVHEC^TO^ 

KjQMVHEGNQRWDNLQKRVTS UjRRLKHFIGQREEFETARDS I LVWLTEMDLQLTNIEHFSECDVQAKI KQ 

I1KAFQQEISI1NHNKIEQI IAQGEQLIEKSEPLDAAI IEEELDELRRYCQETVPGRVERYHKKLIRIiPIiPDD 

EHDIiSDRELELEDSAAIiSDIiHWHDRSADSLLSPQPSSNLSLSLAQPLRSERSGRDTPASVDSIPIiEWDro 

YDLSRDLESAMSRALPSEDEEGQDDKDFYIJ*GAVALSGDHSAIiESQI^ 

KTPTGPELDTSYKGYMKLIjGECSSSIDSVKRLEHKLKEEEESL^ 

LSKELRMKQNLQKWQQFNSDLNSIWAWLGDTEEELEQLQR^ 

ILSINLCSPEFTQADSKESRDLQDRLSQMNGRVTORVCSI^^ 

IDRRKNEIVPIDSOT1DAEILQDHHKQLMQIKHELI1ESQLRVASLQDMS 

GNRLKIJjLKEVSRHIKEIiEKIiljDVSSSQQDLSSWSSADELDTSGSVSPTSGRSTPNR 

GPSVSSPHSRSTKGGSDSSLSEPGPGRSGRGFLFRVIiRAALPIjQLLIjLIjLI^ 

NFARSFHPMLRYTNGPPPL . 

Human SYNE1 Protein sequence - var6 (public gi: 12698057) (seq id no: 300) 

QRKLEQHKDLIiQNTDAHKRAFHE I YRTRSVNGI PVPPDQLEDMAERFHFVS STSELHLMKME FLEtiKYRL 

LSLLVLAESKLKSWIIKYGRRESVEQLLQNYVSFIENSKFFEQYEVTYQILKQTAEMYVKATC 

VMKFMNETTAQWRNLSVEVRSVRSMLEEVISN^^ 

WIQQHTAMNDAGNFLIETCDEMVSRDLKQQIiLLLNGRWRELFMEVKQYAQADEMDR^ 

FATE AHKKLSE PLEVS FMNVKLL I QDLED I EQRVPVMDAQ YKI I TKTAHL I TKESPQEEGKEMFATMSKIi 

KEQLTKVKECYSPLLYESQQLLIPLEELEKQMTSFYDSI^KINEIITVLEREAQSSALFKQKHQELLACQ 

ENCKKTLTLIEKGSQSVQKFVTLSNVI^FDQT^ 

EESRAELEKVLRIAQEGLEEKGDPEELIiRRHTEFFSQLDQRVLNAFLKACDELTD 

KLHKQWmLQGEAPYHLLHLKIDVEKNRF 

LCEKRLQLIEELCVKLPVRDPVRDTPGTCHV^^ 

NETQLKGIKGEAIDTANHGEVKRAVEEIRNGVTKRGETLSWL^ 

SFKALVTLLSEVEKMLSNFGDC^QYKEIVKNSLEELISGSKEVQEQAEKILDTENLF 

KRISAKKRDVMQIAQAQC^EGGLPDRGHEELRKLESTLDGLERSRERQERRIQVTLRKWERFETNKETV 

VRYLFQTGSSHERFIiSFSSLESLSSELEQTKEFSKRTESIAVQAENLVKEASEIPLGPQNKQLLQQQAKS 

IKEQVKKLEDTLEEEYVIDKS 

Human SYNE 1 Protein sequence - var7 (public gi; 2895593) (seq id no: 301 

MKQNLQKWQQFNSDLNSIWAWIiGDTEEELEQLQRLELS ILS INL 

CSPEFTQADSKESRDLQDRLSQMNGRWDRVCSLLEEWRGLLQDA^ 

EIVPIDSNLDAEILQDHHKQLMQIKHELLESQLRVASLQDMSC^ 

LLKEVSRHIKELEKLLDVSSSQQDLSSWSSADELDTSGSVSPTSGRSTPNRQKTPRGKCSLSQPGPSVSS 

PHSRSTKGGSDSSLSEPGPGRSGRGFLFRVLRAALPI^LLLLLLIG^^ 

TPCSDTRMAIiLHSELSRCHLQKCW 

Human SYNE I Protein sequence - var8 (public gi: 6330957) (seq id no: 302 
ldlcrqsnnlclqreedlqrtrdyhdcmnvvevflek^ 

kyai edlkdqkqkmi ehlnlddkelvkeqtshleqrwfqledli krki qvsvtnleelnwqsrfqelme 

WAEEQQPNI AEALKQSPPPDMAQNLLMDHIjAI CSEI1EAKQMLI1KSLI KDADRVMADIjGIiNERQVIQKALS 
DAQSHVNCLSDLVGQRRKYLNKALSEKTQFLMAVFQATSQIQQHER^ 
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AQASIJCTYQNEVTGLWAQGRELMKEVTE^^ 

EDFDKACHWLKQMIVTFPEINIjMN^ 

SYLSEKLNAIjPRQFNVTVAIiAKDKFYKVQEAILAR^ 

ALSLQDGCRAILDEVAGLGEAVDELNQKiaSGFR^ 

AYQEHEKMCQQLERQLKSVKEEQSKVl^ETLPAEEKIiKMYH^ 

LAYEKARHQI QS WQGELKIiLTSAI GETVTECESRMVQS IDFQTEMSRSLDWLRRVKAEIjSGPVYTjDIiNLQ 
DIQEEIRKIQIHQEEVQSSIiRIMNALSHKEKEKFTKAKEL^ 

TE I YSQCQRYYQVFQAANDWLEDAQEMIjQIiAGWGLDVESAEENLKSHME FFSTEDQFHSNIjEEIiHS IiVAT 

LDPLIKPTGKEDLEQKVASLELRSQRMSRDSGAQW 

LLKTSSSHEAEEKIiSEHKALVSVVNSFHEKIVAIjEEKASQIjEKTGN^ 

AQDQEKILEDAVDEWTGFNNK\nOCATE^ 

GMLRQQTLSF^QDGAAPTPGEEPPLM 

VQETKEYLGNPTIEIDAQLEELQILLTEATNHRQ 

IRDQIQDKIKEVEQSKATSQELSRQIQKIiAKDLTTILTKLKAKTDNW 

LDAAVQKFLEQNGQLGKPLAKKIGKLTEIiHQQTI RQAENRIiS KLNQAASHLEE YNEMIiELI LKWI EKAKV 
IiAHGT I AWNSASQLREQYILHQVTLGKI I FKK 

Human SYNE1 Protein sequence - var9 {public gi: 20521662) (seq id no: 303 

WISLMENVIQKDEDNIKNSIGYKAIHEYIjQKYKGFKIDINCKQLTVDFW 

FAEQLGAMNKSWQILQGIjWEKIQLLEGLIjESWSEYENOT 

KDCQDLEDLIKAKEKEVEKIEQNGLALIQNKKEDVSSi™ 

WSSHKVAFDKINS YLMEARYSLS RFRIjLTGSIjEAVQVQVDNIjQNIjQDDLEKQERSIjQKFGS ITNQLIiKEC 
HPPVTETLTNTLKEVNMRWNNLiI<EE IAEQLQS SKAIjLQLWQRYKDYSKQCASTVQQQEDRTNEIjLKAATN 
KDIADDEVATWIQDCmLLKGIiGTVKDSLFVLHELGEQLKQQVDASAASAIQSM 
KQQTSLQAGVIiDYETFAKSLEAIiEAWIVEAEEIIiQGQDPSHSSDL^ 
RIjNELGYRIjPLNDKEIKRMONIiNRHWSLISSQTTERFS 

EISGNYQHLLEQQRAHELFQAEMFSRQQIIjHS 1 1 IDGQRLLEQGQVDDRDEFNLKLTLLSNQWQGVIRRA 

qqrrgi idsqirqwqryremaeklrkwlvevsyiipmsglgsvpi plqqartlfdevqfkekvfbrqqgs y 

iltveagkqlllsadsgaeaai^aelaeiqekwksasmriieeqkkk^ 

fkkklsqslpdhheeiihaeqmrckelenavgswto^ 

^elchqlslrrqqigerlnewavfseknkelct^^ 

nkiqlqgmgerlakasheskaseieyklgkwdr^ 

ahieselakpivydscnseeiqrklneqqelqrdiekhstgv^ 

atrnldrrwrnicamsitorrlkieetwrlwqk^^ 

FEAFQRQVHECLTQLELINKQYRRIJUIEI^TDSACSLKQMVHEGNQRWDNIjQKRVTS I LRRLKHFI GQRE 

EFETARDSILVWLTEMDLQLTNIEHFSECDVQAKIKQIiKAFQQEISLNHNK^ 

AAIIEEELDELRRYCQEVFGRVERYHKKLIRIiPLPDDEHDLSDRED^ 

QPSSNLSLSLAQPLRSERSGRJDTPASVDSIPLEWDHDYDLSRDLESAMSRALPSEDEEGQDDKDFYLRGA 
VALSGDHSALESQIRQLGKALDDSRFQIQQTENIIRSKTPTGPELDTSYKGYMKLLGECSSSIDSVKRLE 
HKBKEEEESDPGFVNLHSTETQTAGVIDRWELLQAQALSK^ 
ELEQLQRLELSTDIQTIELQIKKLKELQKAVDHRKM^ 

DRVCSLLEEWRGLIjQDALMQCQGFHEMSHGLDIjMLENIDRRKNEIVPIDSNIjDAEILQDHH 
LLESQLRVASLQDMSCQLLVNAEGTDCLEA^ 

•WSSADELDTSGSVSPTSGRSTPNRQKTPRGKCSIjSQPGPSVSSPHSRSTKGGSDSSLSEPGPGRSGRGFL 
FRVIiRAALPLQLLLIJ^LIGLACLVPMSEEDYSCAIaSNNFARSFHPMLRYTNGPPPL 

Human SYNE1 Protein sequence - varlO {public gi: 28195689) (seq id no: 304 

MTEKIiQYLTSVYCTEKMS QQVAELGRETEEIiRQMI KIRIjQNIjQDAAKDMKKFEAELKKLQAALEQAQATL 
TSPEVGRLSLKEQLSHRQHLIiSEMESLKPKVQAVQLTOSALRIPEDVVASLPLCHAAIjRLQEE 

aiqqcnimqeawqyeqyeqemkhlqqliegahreiedkpvatsniqelqaqisrheelaqkikgyqeqi 
aslnskckmiitmkakhatmlltotevegii^ 

eegtns e i s s ppacrs ps pvantdasvnqdi ayyqals aerlqtdaaki hpstsasqefye pgle ps ata 

kmdlqrswetlkbwisekqrtiiyealerqqkyqd 

iimdeilmlqdeinelqsslaeelvsesc^ 

leeqrqeqalqryrceadeldswllstkatldtalsppkep 

hnenlllegkahtkdeaeqiagklrrlkgs llelqralhdkqlnmqgtaqekeesdto pgvqew 
laqarttwtqqrqsslqqqkeleqelaeqksllrsvasrgeeiliqhsaaetsgdagekpdvlsqelgm^ 
gekssaedqmrmkweslhqefstkqkllqw^ 
dglnqafeevssqsggakrqsihleqklydgvsatstwi^ 

I KEMSEEMDKNKNLFSQAFPENGDNRDV IEDTLGCIiLGRIiS LLDS WNQRCHQMKERIjQQI LNFQNDIiKV 
LFTSLADNKYI ILQKLANVFEQPVAEQIEAI QQAEDGLKEFDAGI IELKRRGDELQVEQPSMQELSKLQD 
MYDEL^I IGSRRSGLNQNLTLKSQYERADQDIiADLIjETGQEKMAGDQKI IVSSKEEIQQPIiDKHKEYFQ 
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GLESHMILTVTLFRKIISFAVQKETQFHTELI^ 

EVVESSIPSVGLVEENEDRLIDRITLYQHLKSSLNEYQPKLYQVI^^ 

WliSNTtfKMSKELHRLETILKH^ 

SKEVDAQSSLKSSVLSTGNQLLRIxKKVDTATLRS 

ISEVMSWTSLMENAIQKDEDNIKNSIGYKAIHEYLQK^ 

RSDKTDFAEQLGAMNTCSWQILQGLVTEKIQI^EGLijESW 

SVQNALKDCX3DLEDLIKAKDKEVEKIEQNGLALIQTK^ 

KSVI^WSSHKVAFDKINSYLMEARYSLSRFRLLTC 

QLLKEmPPVTETLTOTLKEV^mRWNNIiLEEIAEQL 

LKAATNKDIADDEVATWIQDCNDLLKGLGTVKDSLFVLHEL^ 

IiEQALCKQQTSLQAGVIiDYETFAKSIiEAIiEAWIVEAEEILQGQDPSHSSDLSTIQ 

MAPDLDRLNELGYRLPLNDKE I KRMQNLNRHWSLI SSQTTERFS KLQSFLLQHQTFLEKCETWMEFLVQT 
EQKI^VEISGNYQHLLEQQRAHELFQAEMFSRQQILHS 1 1 IDGQRLLEQGQVDDRDEFNLKLTLLSNQWQ 
GTVIRRAQQRRGIIDSQIRQWQRYREMAEKLRKWLVEVSYLPMSGLGSW 
RQQGSYILTVEAGKQLLLSADSGAEAAMAELAEIQEKWKSASMRLE^^ 
LEKLRTFKKKLSQSLPDHHEELHAEQMRCKEL 

LIK3RQWEELCHQLSIiRRQQIGERIiNEWAVFSEKNKELCEWIiTQMESK^ 

IAIAQENKIQLQQMGERLAKASHESKASEIEYKLGKVNDRWQHI^ 

SLRTWLAHIESELAKPIVYDSC^SEEIQRKLNEQQELQRD 

CHDSIQQATRNLDRRWRNICAMSMERRLKIEETWRLWQKFLDDYSR^ 

KEELKXFEAFQRQVHECLTQLELINKQYRRIiARENRTDSACSLKQMVHEGNQRW^ 

FI GQREEFETARDS ILVWLTEMDLQLTNIEHFSECDVQAKI KQLKAFQQEI SLNHNKIEQI IAQGEQLIE 
KSEPIiDAAI I EEELDELRRYCQEVFGRVERYHKKLIRLPLPDDEHDLSDRELELEDSAALSDLHWHDRSA 
DSLLSPQPSSNLSLSIAQPLRSERSGRDTPASVDSIPLEWDHDYDLSRDLESAMSRALPSEDEEGQDD 
FYLRGAVALSGDHSALESQIRQLGKALDDSRFQIQQTENIIRSKTPTGPELDTSYK^^ 
SVKRLEHKLKEEEESLPGFVNLHSTETQTAGVIDRWELLQAQALSKELRMKQ 

LGDTEEELEQLQRLELSTDIQTIELQI KKLKELQKAVDHRKAI ILS INLCS PEFTQADSKESRDLQDRLS 
Ql^GRWDRVCS LLEEWRGLLQDAIjMQCQGFHEMSHGLLIjMLENI drrkne i vpi dsnldaei lqdhhkql 
mqikhellesqlrvasi^dmscqllwaegtdcleaj<ekvhvignrlklllkevsrhik^ 
qqdlsswssadeldtsgsvsptsgrstpnrqktprgkcslsqpgpsvssphsrstkggsdsslsepgpgr 
sgrgflfrvlraalplqlllllli glaclvpms eedys calsnnfars fhpmlrytngp p pl 

Human SYNE 1 Protein sequence - varll (public gi: 2Bi9S6ii) (seq id no: 305) 

MVVAEDLSALRMAEDGCVDADLPDCNCDVTRARVK^ 

YDS CNSEE IQRKLNEQQEDQRD IEKHSTGVASVLNLCEVLLHDCDACATDAECDS I QQATRNLDRRWRNI 
CAMSME RRLKI EETWRLWQKFLDD YSRFEDWLKS S ERTAAF PS S S GVI YTVAKEELKKFEAFQRQVHE CL 
TQLEL INKQYRRLARENRTDS ACS LKQMVHEGNQRWDNLQKRVT S I LRRLKHF I GQREE FETARDS I LVW 
LTEMDLQLTNT EHFSECDVQAKI KQLKAFQQEI SLNHNKIEQI I AQGEQL I EKSEPLDAAI I EEELDELR 
RYCQEVFGRVERYHKKLIRLPLPDDEHDLSDRELELEDSAALSDLHWHDRSADSLLSPQPSSNLSLSLAQ 
PLRSERSGRDTPASVDSIPLEWDHDYDLSRDLESAMSRALPSEDEEGQDDKDFYLRGAVALSDVMIPESP 
EAYVKLTENAI KNTSGDHS ALESQI RQLGKALDDSRFQIQQTENI I RSKTPTGPELDTS YKGYMKLLGEC 
SSSIDSVKRLEHKLKEEEESLPGFVNLHSTETQTAGVIDRWELLQA 

S I WAWLGDTEEELEQLQRLELSTDIQTIELQI KKLKELQKAVDHRKAI ILS INLCS PEFTQADSKESRDL 
QDRLSQMNGRWDRVCSLLEEWRGLLQDALMQCQGFHEI^HGLLLMLENIDR^ I VP IDSNLDAE I LQD 
•IffiKQLMQIKHELLESQLRVASLQDMSCQLLVNAEGTO 

DVSSSQQDLSSWSSADELDTSGSVSPTSGRSTPNRQKTPRGKCSLSQPGPSVSSPHSRSTKGGSDSSLSE 
PGPGRS GRGFLFRVLRAALPLQLLLLLLIGLACLVPMSEEDYS CALSNNFARS FHPMLRYTNGP PPL 

Human. SYNE 1 Protein sequence- varl2 (public gi: 28192628) (seq id no: 3oe) 

MATSRGT^RCPRDIANVMQRLQDEQEIVQKRTFTKWINSHLAKRKPPMVVDDLFEDMK^ 
SGQKLPCEQGRRMKRIHAVANIGTALKFLEGRXIKLWINSTDIADGRPSIVLGLMWTIILYFQIEELTS 
NLPQLQSLSSSASSVDSIVSSETPSPPSKRKVTTKIQGNAKKALLKWVQYTAGKQTGIEVKDFGKSW 
VAFHS VIHAI RPELVDLETVKGRSNRENLEDAFTI AETELGI PRLLDPEDVDVDKPDEKS IMT YVAQFLK 
HYPDIHNASTDGQEDDEILPGFPSFANSVQNFKREDRVIFKEMKVWIEQFERDLTRAQMVESNLQDKYQS 
FKHFRVQYEMKRKQI EHLIQPLHRDGKLSLDQALVKQSWDRVTSRLFDWHI QLDKSLPAPLGTIGAWLYR 
AEVALREE I TVQQVHEETANTI QRKLEQHK 

Hinnan SYNE1 Protein sequence - varl3 (public gi: 28192522) (seq id NO: 307) 

HI QLDKSLPAPLGTIGAWLYRAEVALREE I TVQQVHEETANTIQRKLEQHKRKCRTMMDLLQNTDAHKRA 
FHEIYRTRSWGIPVPPDQLEDMAERFHFVSPTSELHLMKME^ 

RES VEQLLQNYVS F I ENSKFFEQ YEVTYQI LKQTAEMYVKADGSVEEAENVMKFMNETTAQWRNLSVEVR 
SVRSMLEEVISNWDRYGNTVASIjQAWLEDAEKMLNQSENAKKDFFRNLPHW I QQHTAMNDAGNFLI ETCD 
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KLLIQDLEDIEQRVPVMDAQYKIITKTAHIilTKESPQEEGK^FATMSKLKEQLTKVKECYSPLLYESQQ 
LLIPLEELEKQMTSFYDSIiGKINEI ITVLEREAQSSALFKQKHQ 
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Unigene Name : TTC3 Unigene ID : Hs. 118174 Clone ID : GD_1105 
Human TTC3 mRNA sequence - varl (public gi: 2687860) (SEQ ID NO: 202) 

ATTAAAATAAACATCTTCTGGCCACTT 
CCTGTGTGGACGCCAACAATTCACGTGCTTCTGA^ 
GGAAGATATTGTGGATTTGGCAAAGAAAGTTGCTAATGATTC^^ 
GGTTGTAAAATAGAAAATAAAATCTTGGCAATGGAAGAAGCTC 

TAACAATTCTAACTAAATTAGGATCAATTGACAATTGTTGGCCTATGTTAAGTATTTTCT 

CAAGTACGACATAACTAAAATTGTAATGGAAGACTGCAATTTGCTTGAAGAACTTAAA 

ATGGATTGTATAGAGGAAGGAGAACTAATGAAAATGAAAGGAAATGAAGAGT^ 

ATATAGCTATTATCTATTACACCAGAGCCATTGAATATAGACCTGAAAACTACCTTC 

AGCTCTTTGTTTTCTTCGTACTGGACAGTTTAGAAATGCACT 

AAGAACACTTGGCCAAAGGGTCATTATCGTTATTGTGA 

CCCTGCAAGCA7VAGATAAAAGCTCAAAAACTCTGTAAAAATGACCCTGAGGGAA 
GCAGCATGTAAAGTTACAAAAACAAATAGAAGACCTACAAGGTCGAAGAGCAA 

GCCTTTTATGAAAACAGGGCCTACACACCTAGGAGTTTATCAGCACCTATATTTACTACTTCACTTAACT 

TTGTGGAGAAGGAAAGAGATTTCAGAAAAATTAATCACGAAATGGCCAACGGTGGTAATCAGAATCTAAA 

GGTGGCGGATGAGGCGTTGAAGGTAGATGATTGTGACTGTCATCCTGAATTTTC^CCAC(^TCAAGTCA^ 

CCTCCAAAACATAAAGGAAAACAAAAATCTCGAAACAATGAATGAGAAAAGTT 

TGACTTTACCAGCAGATTTGAAGAACATCTTGGAGAAAC^^ 

GGATTTTGCTAATATAATGAAAATGCTGAGAAGCTTAATTCCAGATGGCTATATGGCCTTATTGGAGCAG 
CGTTGCCGCAGCGCTGCACAGGCCTTTACAGAGTTGCTGAAC 

TGAACCTGGCCATGATTAACTATGTTTTGGTCGTCTATGGACTTGCCATTTCTCTCCTTGGAATAGGACA 

GCCTGAGGAATTATCTGAAGCCGAAAACCAGTTTAAGAGGATTATTGAACACTACCCCAGTGAGGGCCTT 

GATTGCTTGGCCTACTGTGGAATTGGAAAAGTATATTTGAAAAAAAACAGATTTCTAGAAGCrCT 

ACTTTGAGAAAGCAAGAACCTTGATTTATCGTCTTCCTGGAGTGTTAACTTGGCCCACGAGTAATGTGAT 

TATTGAAGAGTCTCAGCCACAAAAAATAAAGATGCTGT^ 

CCAGTGCCAGATGCCATTTGTTGCTATCAGAAGTGCCATGGATATT^ 

ATCC^GACTTTAAGGGTTTTATACGCATCAGCTGTTGCCAGTACTGTAAAATAGAATTTCACATGAATTG 
CTGGAAGAAGTTAAAAACTACAACCTTTAATGATAAAATTGACZAAGGATTTTCTACAAGGAAT 
ACCCCTGACTGTGAAGGTGTCATTTCTAAGATTATCATCTTCAGCAGTGGTGGTGAAGTTAAATGTGAAT 
TTGAACACAAGGTC^TAAAAGAAAAGGTTCCTCCAAGACCTATTCTC 

GAAACTAAGACTGAAAGAAGACAAAAAATTGAAGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCA 
CAAGAAAGAATGGAGGAGGACTTAAGAGAAAGTAATCGACCCAAAAATGAAGAGCAGAAAGAAACTGTAG 
ACAATGTTCAGCGTTGTCAGTTCCTTGATGACAGAATT^ 

TAAATCCGGCATACAGAATACAGCCACGCTTCTCAAAGAATTGCTTTCTTGGAAAGTTTTGAGCACAGAA 
GACTATACAACCTGTTTTTCTAGCAGAAATTTTCTAAATGAAGCAGTGGACTATGTTATTCGCCACTTGA 
TTCAAGAAAATAACAGAGTAAAGACAAGAATATTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCC 
GAAATTAGCCGCCTGGATCCAAAAACTTAATAGCTTTGGCTTAGATGCCACAGGAACTTTCTTTTCTCGT 
TATGGAGCATCTCTTAAACTGCTTGATTTTAGTATC^ 

AACTAGACTCTATAGAAGGAAAGCAACTTGATTATTTCTCTGAGCCAGCATCATTGAAGGAAGCCCGTTG 

TTTAATATGGCTGCTAGAAGAACACAGAGACAAGTTCC 

GATATAATGGACAGCCGCTGTACTGTGTTAAGGAAACAAGATAGTC 

AGGTGAAAAACAAAAGCAAGAAAAAGAAGCCAAAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAAC 
TTCAGTAACTTCAAATAATGAGATCATCACTTCAAGTGAAGACCATAGCAATCGAAATTCAGATTCrG^ 
GGCCCATTTGCAGTGCCTGACCATCTTCGGCAAGATGT^ 

GTAACGAATATGTTGTCCGCAATAAGAAGCTATGGGACATGAACCCAAAACAAAAATGTTCAACTCTATA 

TGATTACTTCTCTCAGTTTTTGGAGGAACATGGTCCCTTGGACATGAGTAAGAAGATGTTCTCTGCAGAA 

TATGAGTTTTTCCCAGAAGAAACTCGACAGATACTAGAAAAAGCAGGAGGTTTAA^CCTTTTCTCTTGG 

GATGCCCTCGTTTTGTTGTGATTGAGAACTGTATTGC^ 

AAGGAAGAAGAAAAACATTAAAACAAAAGTAGAAGAAATTTCTVAAAGC^ 

CTACAACTGAATCCAGCTGCTAGGGAATTTAAACC^GATGTAAAGTCTAAACCAGTGTC^GATTCATCTT 

CAGCACCAGCTTTTGAAAATGTGAAACCCAAACCTGTGTCT 

TGTGAAGGCCAAACCAGTATCCTCACAATTCT^ 

TCTTCTAATTCTCCTAAACCAGGCTCTGAGGATGCAAATTACAAGCGAGTCTCCTGTAATTCCCCCAAAC 
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CGGTTCTTGAGGATGTGAAACCAAC 

TCCTTTCCAGAGATTTGATATCA.CCCAJGACACCGCCAGCA 

TACACCAGCATATATACACXICTTGGCCM 

TGCCX3TCTTTTGTAGCCAATGACAGAGCAGATAAAAATGCT 

TGCTGAGAATGTTGCTGGTCACCAGATTGCCTCTGAAAC^ 

GTAAAGTCACACTGCAGCACAGGTGATGCTCATACAGTCCTGA 

GTGGAAATTCTAACAACAAATGTGAAGTAATTCCAGAAAGCACCAGTGCAGT 

GCAGATGGTTGCCATACAGGTATCITGGAACATAATACACCAAGAAGT 

TTTGAGGAACGACAAGGGGAAATTTCACGGATTGAAAAGGAGCACCAAGTA^ 

AAGTGTATGAAAATTATGAGCAGATAAAACTTAAGGGCTTAGAAGAGACCAGGGACCTC 

GAAAAGGCACTTAGAAGAAAACAAGATCTCAAAGACGGAATT^ 

GAAATTAAAAAATGGCAACAGGAAAAAAAAGAAATCCAA 

AAAAGGTTTCAAATGCCAGTGAAATGTATACCCAGAAAA 

TCTGGATCAGTCCCTTGAAATCAGCAACACACTTACAAATGAGAAAATGAAAATAGAAGAGTATA 

AAAGGGAAAGAGGATTATGAAGAGAGTCATCAGAGAGCTGTGGCT 

GGAAGGAGAGTGAAGTGTATAAGCTACAGATCATGGAGTCACAAGCAG 

GCTGATTAGC CGTGATCCTGCAG CATATCCTGACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTT 

TCTAATGTTACAAAAGTAATTGAGAAAGCAAAGTCTCAGT^ 

GTTCTCGGCTCAGTGAACTTTCTAAAGTGCAGATTTCTGA 

TCCCGAGTTACTCCCTGAGTOTTCAGGCGACX3ATGGC(^^GGGCTTGTGACTTCrrGCAAGCGACGTGACT 

GGAAACCACGCAGCACTTCACAGGGATCCTAGTGTGTT^ 

CTGCGCTGTTGCCAGGGCCACCCCCTGGTCAGCCT^ 

CCAGGCAGCTCTGTCAGAACGAAGCCCTGTGACTGATC 

CGTTCAAGCC^GTCTCCJU^AAAAGCCGTTCAATAGTATTA^ 

ACAACAGCACTGAGCTTGCTGGTTTTATTAAAAAAGTGCG 

GAGTATTGATGAAATTGTCGAAAGAGTGACAGAACACATTCT 

CCAGGAAAGGACAAGAGGACTTATGAGCCCAGCTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCAC 
CCTCGGTGGTTGTTGCACCATCACCCAAAACC^ 

ACTGGGTGCAAGTTCCTGTGAAATATGCCACGAGGTGTTGAAATCAAAAAACGTGCXy 

GGGCACAAGTATCACAAAGGGTGCTTTAAGCAGTGGCTTAAAGGGCAGAGCGCTTGCCCGGCCTGCCAGG 

GTCGTGATCTCCTGACAGAAGAGTCACCTTCTGGAAGAGGCTGGCCCAGTCAGAATCAGGAGCTGCCTTC 

CTGCTCTTCTAGGTAGTCACACTTCACTAAAGTGTCZATCCACCAGTGTGTTGAATCCGAAGAATGACAAT 

TTTCTACCACTGGTGTAAAAAACAAACATTTGAA^^ 

TGGAAATCGTTAATATCGCTGATATTAAGTAATTTCCCCACTCTGAGTGAATACTTTGATGATTGCCAAC 
AGTGGCTAATAAAATGACGGCTACCACACTCATGGGTCACTGGGGCTGCGCAGGGCTCTTTGAGGTGGGT 
GGCTTCTTTTGGAAAGTACTATGAACGTCTCGAAGCAGTATTCTAGTGATAAGAATTCTTAACATAGCCA 
AGCGCCCCACGTTTGTTCCCCACGTTTGTTCCCCTTTTCTGTTTGAATV^CCTGTTCTGGTAGCTCCACA 
AGAGAGATGATACTGACTTTTTAAATTTTTTAC?^AGAGTCTGTATTCCTGATATGCCTATATTTTTCCTC 
AAAGATTCTGCATTTTAAGGATGGGCATAAGCAAACTATATTTTAATAATTTATAGTTAATGTTAAAATA 
TTGGCTGATTTAGACCAAAAGATTCAAATCTCCTCTTTGTGAAATCCCATCTGCATTTGATTTTTTATTA 
TTTTATGTTCCCCCGTTAGATTGTTTTAAGTGTTTGCTTTTCATCTTTTATAGATGTAATCTGATTTTC^ 
AAAATCATTAACACTTTTTAATTAGTATCGACTAAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTCCG 
TCATCCCAGCCCCCGGTTGGAGCCTGCTCTTTGAACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCTTC 
TGTGAGTCAGTCAGCGAGTGCTTGGGATCCGCATCCAGCCGTGCTGA 

AATGGCCACCACCATTCTCCTTCCCCACCCCACCACAAAAAGAGAAGCTGTGTCTTTAGACAACCCTGAG 
GTATCTGTGTTAGAATCGTTCTGTGTTTGATATTTGTGTAAAGTATGCATGCAGTCTTGTACTGTGACCT 
AAGAACAAAACTGTAACTGCATTAGAAACCATGAAAAAATTAGATA 
GTAAATATAGAACCATGAATTCTGGTCACATTC^ 

GAATGTGTTCTTTGTTCCACTGGAAACTTAGAGT ■ 
CCTTTGTTCAdTGTGAGTTCTGATGTCTTAGTGACrrAGTC 

GATTCTCCACGGTGGTCCCCAAAACACTGTCTGCATATCCATAAGAATTGAGCGCTATGGGTGTTAACGT 
GCATGAGGATCAGTTTGCAGCAGCAAGTACAAAAGGAGAAGAGGAACATCCGTTGAATGAGTGTGTTTTG 
TACATAACTTCAGATACTTGTGAACATGCCTTATATTTC 

Human TTC3 mRNA sequence - var2 (public gi: 1632765) (seq id no: 203) 

TACATTTGGAAAGTCTTACTGACATGCAGAAATAGTAC 

GGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACAAGGTCAGGA 
GATCGAGACCATCGTGGCTAACATGGTGAAACCCTGTGCCTACTAAAJ^ 
ACTGGCATGTGOTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCCG 
GGTTGCAGTGAGCCAAGATCACGCCACTGCACTCCAGCCTGGGCGAC^ 

AAACAAAAGAATGCTATGCATAGGTACAATGTCGAAATGTGC^GAAATACTTCAGAAATATTAAAAGT^ 
GTTATTCCTGGGTAGTTGTGATTGTGACTCTGGGTGACTTTTTCCCTTTGTTTTATTTTTCTGTATTTTC 
CAAATTTCCTATAATGGACATATATTATATGGATTTTTTTTAATTAAAATTATCTTTTGACTAGATAATA 



Figure 36 part - 123 



168/202 



WO 2004/078130 



PCT/US2004/006308 



TACATGGAAAATTCACAAAGTACACATATGCAAGATATAATACTTTTACTA 
AAATACTTGTITTTCCTTTTAGAAATGCACTCGGTG^ 
GCCAAAGGGTCATTATCGTTATTGTGATGCTCTTTCTA^ 
AAGATAAAAGCTCAAAAACTCTX5TAAAAATGACCCTGAGGG 

AGTTACAAAAACAAATAGAAGACCTACAAGGTCGAACAGCAAATAAGGATCCAATTAAA 

AAACAGGGCCTACACACCTAGGAGTTTATCAGCACCTATATTTACTACTO 

.GAAAGAGATTTCAGAAAAATTAATCACGAAATGGCCAACGGTGGTAATCAGAATCTAAAGGTG 

AGGCGTTGAAGGTAGATGATTGTGACTGTCATCCTGAATTTTCACCA 

TAAAGGAAAACAAAAATCTCGAAACAATGAATCAGAAAAGTTCAGTTCT 

GCAGATTTGAAGAACATCITGGAGAAACAGTTTT 

ATATAATGAAAATGCTGAGAAGCTTAATTCAAGATGGCTATATGGCCTTATTGGAGCAG 

CGCTGCACAGGCCTTTACAGAGTTGCTGAACGGTTTAGATCCT 

ATGATTAACTATGTTTTGGTCGTCTATGGACTTG 

TATCTGAAGCCX3AAAACCAGTTTAAGAGGATTATTGAACACTACCCCAGTGA 

CTACTGTGGAATTGGAAAAGTGTATTTGAAAAAAAACAGAT^ 

GCAAGAAC CTTGATTTATCGTCTTC CTGGAGTGTTAACTTGGCCCACGAGTAATC 

CTCAGCCACAAAAAATAAAGATGCTGTTAGAGAAATTTGTTC 

TGCCATTTGTTGCTAT CAGAAGTGC CATGGAT ATTCTAAGATCCAGATATACATAACTTGATCCAGACTTT 
AAGGGTTTTATACGCATCAGCTGTTGCCAGTACTGTAAAATAGAATTTCACATGAATTGCTGGAAGAAGT 
TAAAAACTACAACCTTTAATGATAAAATTGACAAGGATTTTCTACAAG 

TGAAGGTGTCATTTCTAAGATTATCATCTTCAGCAGTGGTGGTGAAGTTAAATGTGAATTTGAACACAAG 

GTCATAAAAGAAAAGGTTCCTCCAAGACCTATTCTGAAACAGAAATGTTCTAGCCTAGAG/^AACTAAGAC 

TGAAAGAAGACAAAAAATTGAAGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCAC/^ 

GGAGGAGGACTTAAGAGAAAGTAATCGACCCAAAAATGAAGAGCAGAAAGAAACTGTAGACAATGTTCAG 

CGTTGTCAGTTCCTTGATGACAGAATTCTACAGTGTATAAAGCAGTATGCTGACAAGATTATkATCCGGCA 

TACAGAATACAGCCATGCTTCTCAAAGAATTGCTTTCTrTGGAAAG 

CTGTTTTTCTAGCAGAAATTTTCTAAATGAAGCAGTGGACTATGT^ 

AACAGAGTAAAGACAAGAATATTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCCCAAATTAGCCG 
CCTGGATCGAAAAACTTAATAGCTTTGGCTTAGATGCCACAGGAACTTTCT^ 

TCTTAAACTGCTITGATTTTAGTATCATGACTTTCCTCTGGAATGAGAAATATGGTCACAAACTAGACTCT 
ATAGAAGGAAAGCAACTTGATTATTTCTCTGAGCCAGCATCATTGAAGGAAGCCCGTTGTTTAATATGGC 
TGCTAGAkGAAGACAGAGACAAGTTCCCAGCATTGCAT^ 

CAGCCGCTGTACTGTGTTAAGGAAACAAGATAGTGGTGAAGCACCGTTTAGTTCAACCAAGGTGAAAAAC 

AAAAGCAAGAAAAAGAAGCCAAAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAACTTCAG 

CAAATAATGAGATCATCACTTCAAGTGAAGACCATAGCAATCGAA 

AGTGCCl^CCATCTTCGGCAAGATGTAGAAGAATTCGAAGCTCTCTATGACCAACAC^TT^CGAATAT 
GTTGTCCGCAATAAGAAGCTATGGGACATGAACCC7^AAACAAAAATGTTCAACTCTATATGATTACTTCT 
CTCAGTTTTTGGAGGAACATGGTCCCTTGGACATGAGTAACAAG 

CCCAGAAGAAACTCGACAGATACTAGAAAAAGCAGGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGT 

TTTGTTGTGATTGACAACTGTATTGCACTGAAGAAGGTTGGATCACGGCTCAAGAAAAAAAGGAAGAAGA 

AAAACATTAAAACAAAAGTAGAAGAAATXTCAAAAGCAGGGGAGTATGTACGAGTTAAACTACAACTGAA 

TCCAGCTGCTAGGGAATTTAAAC(^GATGTAAAGTCTAAACCAGTGTCAGATTCATCTTCAGCACCAGCT 

TTTGAAAATGTGAAACCCAAACCTGTGTCTGCAAATTCTCCCAAGCCAGCTTGTGAAGATGTGAAGGCCA 

AACCAGTATCCGACAATTCTTCTAGACAAGTTTCTGAGGATGGGCAACCCAAAGGGGTCTCTTCTAATTC 

TCCTAAACCAGGCTCTGAGGATGCAAATTACAAGCGAGTCTCCTGTAATTCCCCCAAACCGGTTCTTGAG 

GATGTGAAACCAACTTATTGGGCTCAATCCCATTTGGTCACAGGATACTGTACGTATCTTCCTTTCCAGA 

GATTTGATATCACCCAGACACCGCCAGCATACATAAACGTGTTACCAGGTTTGCCCCAGTACACCAGCAT 

ATATACACCCTTGGCCAGCCTTTCTCCTGAATATCAGCTACCAAGATCAGTACCAGTGGTGCCGTCTTTT 

GTAGCCAATGACAGAGCAGATAAAAATGCTGCTGCCTATTTTGAGGGTCATCATTTGAATGCTGAGAATG 

TTGCTGGTCACCAGATTGCCTCTGAAACACAGATCCTTGAGGGCTCTTTGGGAATATCTGTAAA.GTCACA 

CTGCAGCACAGGTGATGCTGATAGAGTCCTGAGTGAGTCTAACAGAAATGATGAGCACTGTGGAAAT 

AACAACAAATGTGAAGTAATTCCAGAAAGCACCAGTGCAGTAACAAA^ 

CCATACAGGTATCTTGGAACZATAATACACCAAGAAGTGAATACTGAGCCATATAATCCTTTTGAGGAACG 
ACAAGGGGAAATTTCACGGATTGAAAAGGAGCACCAAGTATTACAAGACCAACTTCAAGAAGTGTATGAA 
AATTATGAGCAGATAAAACTTAAGGGCTTAGAAGAGACCAGGGACCTGGAAGAGAAGTTGAAAAGGCACT 
TAGAAGAAAACAAGATCTCAAAGACGGAATTAGATTGGTTC 
' ATGGCAACAGGAAAAAAAAGAAATCC^GAAAGACTAAAATCACTGAAGAAGAAAATTAAAAAGGTTT 
AATGCC^GTGAAATGTATACCCAGAAZU\ATGATGGAAAGGAAAAGGAACATGAAT^ 

CCCTTGAAATCAGCAACACACTTACAAATGAGAAAATGAAAATAGAAGAGTATATAAAGAAAGGGAAAGA 
GGATTATGAAGAGAGTCATCAGAGAGCTGTGGCTGC^GAGGTATCCGTACTTGAAAACTGGAAGGAGAGT 
GAAGTGTATAAGCTACAGATGATGGAGTCACAAGCAGAAGCCTTTCTGAAGAAGCTGGGGCTGATTAGCC 
GTGATCCTGCAGCATATCCTGACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTTTCTAATGTTAC 
AAAAGAAATTGAGAAAGCAAAGTOTCAGTTTGAAGAACAAATTAAGGCAATTAAAA^ 
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AGTGAACTTTCTAAAGTGCAGATTTCTGAGCTTTCATTTCCTGCCTC 

TCCCTGAGTCTTCAGGCCACGATGGCCAAGGGCTTGTGACT^ 

AGCACTTCACAGGGATCCTAGTGTGTTCTCTGCTG^ 

CCAGGGCCACCCCCTGGTCAGCCTGAAjKCACT 

TGTC^GAAOJAAGCCCTGTGGCTGATCGGAAGCAGCCTGTTCCT 

GTCTCCAAAAAAGCCGTTCAATAGTATTATTGAGCACCTGTCAGTGGTATTCCCATGTTA 
GAGCTTGCTGGTTTTATTAAAAAAGTGOSAAGCAA 

AAATTGTCCAAAGAGTGACAGAAGACATTCTAGATGAACAGAAAAAGAAAA^ 
CAAGAGGACITATGAGCCCAGCTCTGCCACC^ 

GTTGCACCATCACCCAAAACCAAGGGGCAGAAAGCAGAAGATGTCCCTGTGAGGATTC 

GTTCCTGTGAAATATGCCACGAGGTGTTC&AATCAAAAAACGT 

TCACAAAGGGTGCTTTAAG<^GTGGCTTAAAGGGCAGAGCGCT 

CTGACAGAAGAGTCACCTTCTGGAAGAGGCTGGCCCAGTCAGAATC 

GGTAGTCACACTTCACTAAAGTGTCATCCACCAGTGTGT^ 

GGTGTAAAAAACAAACATTTGAAGACCCTTGTGCATTGTO 

AATATCGCTGATATTAAGTAATTTCCCGACTCTGAGTGAATACTTTGATGATTGCGAACAG 
AAATGACGGCTACCACACTCATGGGTCACTGGGGCTGCGCAGGGCTCT 
GAAAGTACTATGAACGTCTCGAAGCAGTATTCTAGTGATAA 
TTTGTTCCCCACGTTTGTTCCCCTTTTCTC 

ACTGACTTTTTAAATTTTTTACAAGAGTCTGTATTCCTGATATGCCTATATTTTTCCT 

ATTTTAAGGATGGGCATAAGCAAACTATATTTTAATAATTTATAG 

AGACCAAAAGATTCAAATCTCCTCTTTGTGAAATC^ 

CCCGTTAGATTGTTTTAAGTGTraGCTTTT 

CACTTTTTAATTAGTATCGACTAAGACTTTTTCCCCCTGG^^ 

CCCGGTTGGAGCCTGCTCTTTGAACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCTTCTGTGAGTCAGT 
CAGCGAGTGCTTGGGATCCGCATCCAGCCGTGCTGAGCACACAACAGGCTGTGTGTGGAAATGGCCACCA 
CC^TTCTCCTTCCCCACCCC^CCACA7yU^GAGAAGCTGTGTCTTTAGACAACCCTGAGGTATCTGTGTT 
ACAATCG1TCTGTGTTTGATATTTGTGTAAAGTATGG 

TGTAACTGCATTAGAAACCATGAAAAAATTAGATATTGTTTTGTGACTTTTAGACAGTGGTAAATA 
ACCATGAATTCTGGTCACATTCCATTTCTCTCCAACATGAAGG^ 

TTGTTCCACTGGAAACTTAGAGTCATGAGTTTATGAGCTGATTTGGTCACCTTCCTCTGCCTTTGTTCAC 

TGTGAGTTCTGATGTCTTAGTGACTTAGTTCTTAGAAGCTCACGCCTTAGTTTGAA^ 

GTGGTCCGCAAAACACTGTCTGCATATCCATAAGAATTGAGCGCTATGGGTGTTAACGTGCATGAGGATC 

AGTTTGCAGGAGCAAGTACAAAAGGAGAAGAGGAACATCCGTTGAATGAGTGTGTTTTGTACATAACTTC 

AGATACTTGTGAACATGCCTTATATTTGTCCAAGAACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 mRNA sequence - var3 (public gi: 1632763) (seq id no: 204) 

CTGAACTAGTTGCCAGTGATCTTGAAACGTGACAGTAACCAAGAGATAAATAGGTGACAATGACAGGAAA 

ATTAGATGTAGTAAAAGAGAGTGTTTGAGAGCAGAAGCTATGGCAACTAAAGACTGGATTTGAATCCTTC 

CTAGCTTGGTGACATGAGCAAATTACTTGATTTAAGTGAGCATTTTCCCATCTGTACAGTGGAGATAACG 

ATAATTGTGCCTGCTAAGT^GAATTGCTGTGAAGATTAGTGAAATAATGCATGTAAAACATTTGGTACAG 

TATGTGACACATAGTACAAATAGTTTGCTAGGAAGATTGTTATTATTCTTCACTTGTGATATTGTGAAGT 

TTTCATACAGCAAATTGGACATCATGAGATGGATTGATTAAATAAATAGATTTGAACTTCAAGGACTGGT 

AGTGTTCI^GCTTTGGAAAGAAGAAACTTGGTTTATCCTAATAATAGTAGGATAAT^ 

AGGTACAAGTAATAGTGTTTATGATGCGCTGGTGATGATAGGAAAAGAAAGCCATTATATGGGCAAGAGC 

TAGAAGTAATAAAATGGTGC^TTTTTCAGTGATGTTTGGCCTATGTAGCTATTCTCTGATAACTATAAAA 

ATCCTTATTATTGAAGATTCTTCAGGAAAAAAAAACCCTTAGTCTG 

CCCCCGATTGAAATACGTATTTTTAAAACATGGCTTTTGATAATC 

TAGCAGTGCTGATTGTGTATTGCAGTAGTTGTGAGAGCATTAGAAGCAGCAGTCGATAGGAGGATGGAAG 
GTCTGGATGCCGCCTTGGGGAGTTAGGAGATTGGCAGACTTACCCTGT^ 

GCCCAAGACAGAAACACACTGAGATGGATAGGAGAATATGAGCAGTTGATAGGAAAGTTCTCAGTGGAGT 

C^GGATTTAGGTTAGGCCAGGAGATTGAGAATATAACAGTCT 

AATGCAGTAAAAGCAGGTAGGGTACAAGTGCAGCAACAG^ 

TATTTGAGAGCTTACCATGTGCTAGGCACATAaVAAGATAAATAAGATGCCCrTG 

AAGGAGACATGTAAACAGGTTAACTTAGAGTAGAGATGGTGAATATGTGAACCTGAGGAAAGGAAGAAAT 
AGATTAAATTATCTGGAGAGAGAGGAAAAGTCAGCAGAATGGGGACGAGAATCTTTCGGAGCTCAGTGTT 
CTGATAGGAGTTATTTCCTTGGGCATAGGTTCCAAGTATTTTTCTAATATACGATAGAAGCCAGGAAAAC 
TTTCITCTGTTATCTCAAATGATTTAATT^ 

TGGACAATTTTGCTGAGGGAGATTTCACTGTGGCGGATTATGCCTTGTTAGAAGATTGCCCTCACGTGGA 
TGATTGTGTCTTTGCTGCTGAATTTATGAGCAA 

GTGGGTGTGCAATATAAAGATTATATCCAAAGTGAGAGGAATTTGGAATTTGACATCTGCAGTATATGGT 
GTAGTAAACCAATTTCTGTCCTGCAAGATTATTGCGATG 

GTTTC^C^TCAAAAC^GTTCCGTAATATCACGATTGCATCCCTGTGTGGACGCCAACAATTCACGTGCT 
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TCTGAGATAAATTTGAAGAAA 

TTGCTAATGATT.CATTCCTTATTGGAGGCTTATTGAGAATTC 

AATGGAAGAAGCTCTGAATTGGATAAAATATGCAGGCGATGTAACAATTCT 

GT^CAATTGTTGGCCTATGTTAAGTATTTTCTTTA 

GAAGAGTTTTCCAAAGAAAGATTTGATATAGCTATTATCTATTAC^ 

AAAACTACCTTCTTTATGGTAACCGAGCTCTTTGTTTTCT^ 

TGATGGAAAGAGAGCCACTATTCTGAAGAACACTTG^ 

TCTATGCTGGGGGAATATGACTGGGCCCTGC^GCAAACATAAAAGCT 

CTGAGGGAATCAAGGATCTAATTCAGCAGCATGTAAAGTTACAAAAACAAAT 

AACAGCAAATAAGGATCCAATTAAAGCCTTTTATGAAAACAGGGCCTACACACCTAGGA 

CCTATATTTACTACTTC^CTTAACTTTC^ 

CCAACGGTGGTAATCAGAATCTAAAGGTGGCGGATGAGGCGTTC 

TGAATTTTCACCACCATCAAGTCAGCCTCCAAAACATAAAGGA 

GAAAAGTTCAGTTCTAGTTCACCATTGACTTT^ 

CTAAATCTTCCAGAGCTGCACACCAGGATTTTGCTAATATAATGAAAATG 

TGGCTATATGGCCTTATTGGAGCAGCGTTGCCGCAGCGCTGCACAGGCCTTTACAGAGTTC 

TTAGATCCTCAAAAAATAAAGCAATTGAACCTGGCCATGATTAA 

CCATTTCTCTCCTTGGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACCAGTTTAAGAGGATTAT 
TGAACACTACCCCAGTGAGGGC CTTGATTGCTTGGC CTACTGTGGAATTGGAAAAGTGTATTTG A?UUU^A 
AACAGATTTCTAGAAGCTCTCAATCACTTTGAGAAAGCAA 

TAACTTGGCCCACGAGTAATGTGATTATTGAAGAGTCTCAGCCACAAAAAATAAAGATGCTGTTAGAGAA 
ATTTGTTGAAGAATGCAAGTTCCCTCCAGTGCCAGATGCCAT 

TCTAAGATCCAGATATACATAACTGATCCAGACTTTAAGGGTTTTATACGCATCAGCTGTTG 
GTAAAATAGAATTTCA.CATGAATTGCTGGAAGAAGTTAAAAACTACAACCTTT 

GGATTTTCTACAAGGAATATGTCTTACCCCTGACTGTGAAGGTGTCA.TTTCTAAGATTATCATCTTCAG 

AGTGGTGGTGAAGTTAAATGTGAATTTGAACACAAGGTCATAAAAGAAAAGGTTCCTC 

TGAAACAGAAATGTTCTAGCCTAGAGAAACTAAGACTGAAAGAAGACAAAAAATTGAAGAGAAA 

AAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAATGGAGGAGGACTTAAGAGAAAGTAATCCACCCA^ 

AATGAAGAGCAGAAAGAAACTGTAGACAATGTTGAGCGTTGTCAGTTCCTTGATGACAG 

GTATAAAGCAGTATGCTGACAAGATTAAATCCGGCATACAGAAT^ 

TTCTTGGAAAGTTTTGAGCACAGAAGACTATACAACCTGTTTTTCTAGCA > GAAATTTTCTAAATGAAG(^ 
GTGGACTATGTTATTCGCCACTTGATTCAAGAAAATAACAGAGTAAAGACAAGAATATTTCTGCATGTOT 
TGAGTGAGCTTAAAGAAGTGGAGCCCAAATTAGCCGCCTGGATCCAAAAACTTAATAGCTTTGGCTTAGA 

TGCCACAGGAACTTTCTTTTCTCGTTATGGAGG^^ 

CTCTGGAATGAGA^TATGGTCACAAA.CTAGACTCTATAGAAGGAAAGCAACTTGATTATTTCTCTGAGC 
CAGCATCATTGAAGGAAGCCCGTTGTTTAATATGGCTGCTAGAAGAACACAGAGACAAGTTCCCAGCATT 
GCATAGTGCTTTAGATGAATTCTTTGATATAATGGACAGCCGCTGTACTGTGTTAAGGAAACAAGATAGT 
GGTGAAGCACCGTTTAGTTCAACCAAGGTGAAAAACAAAAGCAAGAAAAAGAAGCCAAAGGATTCAAAGC 
CTATGTTAGTTGGGTCTGGAACAACTTCAGTAACTTCAAATAATGAGATCATCACTTCAAGTGAAGACCA 
TAGC^TCGAAATTCAGATTCTGC^GGCCCATTTGCAGTGCCTGACCATCTTCGGCl^GATGTAGAAGAA 
TTCGAAGCTCTCTATGACCAACACAGTAACGAATATGTTGTCCGCAATAAGAAGCTATGGGACATGAACC 

CAAAACAAAAATGTTCAACTCTATATGATTACTTCTCT 

GAGTAACAAGATGTTCTCTGCAGAATATGAGTTTTTCCCAGAAGAAACTCGACAGATACTAGAAAAAGCA 
GGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGTTTTGTTGTGATTGACAACTGTATTGCACTGAAGA 
AGGTTG C ATCACGGCTCAAGAAAAAAAGGAAGAAGAAAAACATTAAAACAAAA CAAA 
AGCAGGGGAGTATGTACGAGTTAAACTACAACTGAATCCAGCTGCTAGGGAATTTAAACCAGATGTAAAG 
TCTAAACCAGTGTCAGATTCATCTTCAGCACCAGCTTTTGAAAATGTGAAACCCA 

ATTCTCCCAAGCCAGCTTGTGAAGATGTGAAGGCGAAACCAGTATCCGACAATTCTTCTAGACAAGTTTC 
TGAGGATGGGCAACCCAAAGGGGTCTCTTCTAATTCTCCTAAACCA.GGCTCTGAGGATGCAAATTACAAG 

CGAGTCTCCTGTAATTCCCCCAAACCGGTTCCT 

TGGTCACAGGATACTGTACGTATCTTCCTTTCCAGAGATTTGATATCACCCAGACACCGCCAGCATAC^ 
AAACGTGTTACCAGGTTTGCCCCAGTAGkCCAGCATO 

CAGCTACCAAGATC^GTACCAGTGGTGCCGTCTTTTGTAGCCAATGACAGAGCAGATAAAAATG 
CCTATTTTGAGGGTCATCATTTGAATGCTGAGAATGT^ 

CCTTGAGGGCTCTTTGGGAATATCTGT7^AAGTCACACTGCAGCACAGGTGATGCTCATACAGTCCTGAGT 

GAGTCTAACAGAAATGATGAGC^CTGTGGAAATTCTAACAACAAATGTGAAGTAATTCCAGAAAGCACCA 

GTGCAGTAACAAACATTCCACACGTGCAGATGGTTGCCATACAGGTATCTTGGAACATAATACACCAAGA 

AGTCAATACTGAGCCATATAATCCTTTTGAGGAACGACAAGGGGAAATTTCACGGATTGAAAAGGA^ 

CAAGTATTACAAGACCAACTTCAAGAAGTGTATGAAAATTATGAGCAGATAAAACTTAAGGGCTTAGAAG 

AGACCAGGGACCTGGAAGAGAAGTTGAAAAGG(^CTTAGAA(^^ 

TTGGTTCCTTCAAGATTTGGAAAGAGAAATTAAAAAATGGGJ^CAGGA 

CTAAAATCACTGAAGAAGAAAATTAAAAAGGTTTGAAM 

GAAAGGAAAAGGAACATGAATTACATCTGGATGAGTCCCTTGA^ 



Figure 36 part - 126 



171/202 



WO 2004/078130 PCT/US2004/006308 

/^TGAAAATAGAAGAGTATATAAAGAAAGGGAAAGAGGATTA 

GCAGAG0TATCCX3TACTTGAAAACTGGAAGGAGAGTGAA 

CAGAAGCCTTTCTGAAGAAGCTGGGGCTGATT^^ 

TATACGTTCATGGGAATTGTTTCTTTCTAATGT^ 

GAACAAATTAAGGCAATTAAAAATGGTTCTCGGCTC^ 

CATTTCCTGCCTGTAACACGGTTCATCCCXjAGTTACTCCCTGAGT 

TGTGACTTCTGCAAGCGACXjTGACTGGAAACCACGCAGCACTTCA 

GGTGATTCCCCAGGGGAGGCTCCTTCTGCGCTGTTGCCAGGGCCACCCCCTGG 

AGCTGACAGGGCCAAAACGGGCTGGCCAGGC^GCTCT 

GCCTGTTCCTCCAGGATOTGCTGOSCGTTCAAGCC^ 

CACCTGTC^GTGGTATTCCCATGTTACAACA^ 

AAAACAAGAACTCACTCTCAGGATTGAGTATTGATGAAATTGTCCA^ 

TGAACAGAAAAAGAAAAAGCCAAACCCAGGAAAGGACAAGAGGACTTATGAGCCC^ 

GTGACCAGGTCCTCCCAGGGCTCACCCTCGGTGGTTGTTGiCACCATCACCCAAAACCAAGGGGCA 

CAGAAGATGTCCCTGTGAGGATTGCACTGGGTGCAAGTTCCTGTGAAATAT^ 

AAAAAACGTGCGTGTGCTCAAATGTGGGCACAAGTATCACAAAGGGTGCTTTA^ 

CAGAGCGCTTGCCCGGCCTGCCAGGGTCGTGATCTCCT 

CCAGTCAGAATCAGGAGCTGCCTTCCTGCTCTTCTAGGTAGTCA 

TGTGTTGAATCCGAAGAATGACAATTTTCTACCACTGGTGTAAAAAAGAAACATT 

ATTGTGTGTCACAAAGCTAAATAGATGGAAATCGTTT^ 

AGTGAATACTTTGATGATTGCCAACAGTGGCTAATAAAATGACGGCTACGACACTCATGGGTGACTGGGG 

CTGCGCAGGGCTCTTTGAGGTGGGTGGCTTCTTTTGGAAAGTACTATGAACGTCTCGAAGCAGTATTCTA 

GTGATAAGAATTCTTAACATAGCCAAGCX3CCCCACGTTTGTTCCCGACGTTTGTTCCCCTTTTCTGOT 

AAAAACCTGTTCTGGTAGCTCCACAAGAGAGATGATACT 

TCCTGATATGCCTATATTTTTCCTCAAAGATT^ 

ATAATTTATAGTTAATGTTAAAATATTGGCTGATTTAGACCAAAAGATTCAAATCTCCTCTTTGTGAAAT 
CCCATCTGCATTTGATTTTTTATTATTTTATGTTCCCCCGTTAGATTGTTTTAAGTGTTTGCTTTTCATC 
TTTTATAGATGTAATCTGATTTTCAAAAATCATTAACACTTTTTAATTAG 

CCCTGGAATCGAGGOTGTGTGTCCGTCATCCCAGCCCCCGGTTGGAGCCTGCTCTTTGAACTCCGCT 
TTCCTTAGCAGCTTCTGTCCTCTTCTGTGAG 

GAGCACACAACAGGCTGTGTGTGGAAATGGCCACCACCATTCTCCTTCCCCACCCCACCACAAAAAGAGA 

AGCTGTGTCTTTAGACAACCCTGAGGTATCTGTGTTAGAATCGTTCTGTGTTTGATATTTGTGTAAAGTA 

TGCATGCAGTCTTGTACTGTGACCTAAGAACAAAACTGTAACTGCATTAGAAACCATGAAAAAATTAGAT 

ATTGTTTTGTGACTTTTAGACAGTGGTAAATATAGAACCATGAATTCTGGTCAC^TTCCATTTCTCTCCA 

ACATGAAGGATCAAAAAATGTTTTTCAATGTGTTCTTTGTTCCACTGGAAACTTAGAGTCATGAGTTTAT 

GAGCTGATTTGGTCACCTTCCTCTGCCTTTGTTC^ 

GAAGCTCACGCCTTAGTTTGAAAGAGATTCTCCACGGTGGTCC^ 

AATTGAGCGCTATGGGTGTTAACGTGC^TGAGGATCAGTTTGCAGCAGCAAGTACAAAAGGAGAAGAGGA 
ACATCCX3TTGAATGAGTGTGTTTTGTACATAACTTCAGATACTTGTGAACATGCCTTATATTTGTCCAAC 
AACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 mRNA sequence - var4 (public gi: 163276D (seq id no: 205) 

CTGAACTAGTTGCCAGTGATCTTGAAACGTGACAGTAACCAAGAGATAAATAGGTGACAATGAC^ 

ATTAGATGTAGTAAAAGAGAGTGTTTGAGAGCAGAAGCTATGGCAACTAAAGACTGGATTTGAATCCTTC 

CTAGCTTGGTGACATGAGOVAATTACTTGATT^ 

ATAATTGTGCCTGCTAAGAAGAATTGCTGTGAAGATTAGTGAAATAATGCATGTAAAACATTTGGTACAG 
TATGTGACACATAGTACAAATAGTTTGCTAGGAAGATTGTTATTATTCTTCACTTGTGATATTGTGAAGT 
TTTCATACAGCAAATTGGACATCATGAGATGGATTGATTAAATAA^ 

AGTGTTCTTGCTTTGGAAAGAAGAAACTTGGTTTATCCTAATAATAGTAGGATAATAATGGTGAAGTGAT 
AGK3TACAAGTAATAGTGTTTATGATGCGCTGGTGATGATAGGAAAAGAAAGCCATTATATGGGCAAGAGC 
.TAGAAGTAATAAAATGGTGCATTTTTCAGTGATGTTTGGCCTATGTAGCTATTCTCTGATAACTATAAAA 
ATCCTTATTATTGAAGATTCTTC^GGAAAAAAAAACCCTTAGTCTGAAACTTTAGCACCAATCCCCCTTG 
CCCCCCATTGAAATACGTATTTTTAAAACATGGCTTTTGATAATGTGAGGGTTTTTTCCTTTTTGCGATT 
TAGCAGTGCTGATTGTGTATTGC^GTAGTTGTGAGAGCAT^ 

GTCTGGATGCCGCCTTGGGGAGTTAGGAGATTGGCAGACTTACCCTGTACCACTCTAGCCCTACTCCTTT 

GCCCAAGACAGAAACACACTGAGATGGATAGGAGAATATGAGCAGTTGATAGGAAAGT^ 

CAGGATTTAGGTTAGGCCAGGAGATTGAGAATATAACAGTTTGTGTATGATGAAATGGCATATTTCACAG 

TATTTGAGAGCTTACCATGTGCTAGGCACATACAAAGATAAATAAGATGCCCTTGATGATCCTCTATTTA 
AAGGAGACATGTAAACAGGTTAACTTAGAGTAGAGATGGTGAATATGTGAACCTGAGGAAAGGAAGAT^AT 
AGATTAAATTATCTGGAGAGAGAGGAAAAGTCAGCAGAATGGGGACGAGAATCTTTCGGAGCTCAGTGTT 
CTGATAGGAGTTATTTCCTTGGGCATAGGTTCCAAGTATTTTTCTAATATACGATAGAAGCCAGGAAAAC 
TTTCTTCTGTTATCTCAAATGATTTAATTACTGACTTGAGTTTGTGTTGTCTCCTTAGACTTGTGCACCA 
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TGGACAATTTTGCTGAGGGAGATTTCACTGTGGCGGAT^ 

TGATTGTGTCTTTGCTGCTGAATTTATGAGC 

GTGGGTGTGCAATATAAAGATTATATCOU^GTGA 

GTAGTAAACCAATTTCTGTCCTGCAAGATTATTGCGATGCCATO 

GTTTCAACATCAAAACAGTTCCGTAATATCACGATT^ 

TCTGAGATAAATTTGAAGAAACTACAACATCTTGAGTTGATGGAAGATATTO 
TTGCTAATGATTCATTCCTTATTGGAGGCT^^ 

AATGGAAGAAGCTCTGAATTGGATAAAATATGCAGGCGATGTAACA 

GACAATTGTTGGCCTATGTTAAGTATTTTCTTTACTGAATACAAGTACCACATAACTAAAATTC 

AAGACTGCAATTTGCTTGAAGAACTTAAAACTCAAAGTTGTATGGATTGTA 

GAAAATGAAAGGAAATGAAGAGTTTTCCAAAGAAAGATTTGATATAGCTATTATCTATTAC^ 

ATTGAATATAGACCTGAAAACTACCTTCI^TATGGTAACCGAGCTCTTTGTTOT 

TTAGAAATGCACTCGGTGATGGAAAGAGAGCCACTATTCTGAAGAACACTTGGCCAAAGGGTC^ 

TTATTGTGATGCTCTTTCTATGCTGGGGGAATATGACTGGGCCCTGCAAGCAAAC^ 

CTCTGTAAAAATGACCCTGAGGGAATCAAGGATCTAATT<^^ 

AAGACCTACAAGGTCGAACAGCAAATAAGGATCCAATTAAAGCCT 

TAGGAGTTTATCAGCACCTATATTTACTACTTCACTTAACTTTGTGGAGAAGGiUVAGAGATTTCAGAAAA 

ATTAATCACGAAATGGCCAACGGTGGTAATCAGAATCTAAAGGTGGC^^ 

ATTGTGACTGTCATCCTGAATTTTCACCACC^TG^ 

TCGAAACAATGAATCAGAAAAGTTCAGTTCTAGTTCA.CCATTGACTTTACCA 

TTGGAGAAACAGTTTTCTAAATCTTC(^GAGCTGCACAC(^GGATTTTGCTAATATAATGAAAATGCTGA 

GAAGCTTAATTCAAGATGGCTATATGGCCTTATTGGAGCAGCGTTGCCG^ 

AGAGTTGCTGAACGGTTTAGATCCTCAAAAAATAAA^ 

GTCGTCTATGGACTTGCCATTTCTCTCOTTGGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACC 
AGTTTAAGAGGATTATTGAACACTACCCCAGTGAGGGCCTTGATTGCTTGGCCTACTGTGGAATTGGAAA 
AGTGTATTTGAAAAAAAACAGATTTCTAGAAGCTCTCAATCACTTTGAGAAAGCAAGAACCTTGATTTAT 
CGTCTTCCTGGAGTGTTAACTTGGCCCACGAGTAATGT^ 

AGATGCTGTTAGAGAAATTTGrTGAAGAATGCAAGTTCCCTCCAGTGCCAGATGCC^TTTGTTGCT'AT^ 
GAAGTGCCATGGATATTCTAAGATCCAGATATACATAACTGATCCAGACTTTAAGGGTTTTATACGCATC 
AGCTGTTGCCAGTACTGTAAAATAGAATTTC^CATGAATTGCTGGAAGAAGTTAAAAACTACAACCTTTA 
AT^ATAAAATTGAC^GGATTTTCTAC^AGGAATATGTCTTACCCCTGACTGTGAAGGTGTCATTTCTAA 
GATTATCATCTTCAGCAGTGGTGGTGAAGTTAAATGTGAATTTGAACACAAGGTCATAAAAGAAAAGGTT 
CCTCCAAGACCTATTCTGAAACAGAAATGTTCTAGCCTAGAGAAACTAAGAC 

TGAAGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAATGGAQGAGGACTTAAGAGA 
AAGTAATCCACCCAAAAATGAAGAGCAGAAAGAAACTGTAGACAATGTTCZAGCGTTGTCAGTTCCTTGA^ 
GACAGAATTCTACAGTGTATAAAGCAGTATGCTGACAAGATTAAATCCGGCATACAGAATAGAGCCATGC 
TTCTCAAAGAATTGCTTTCTTGGAAAGTTTTGAGCACAGAAGACT^ 

TTTTCTAAATGAAGCAGTGGACTATGTTATTCGCCACTTGATTCAAGAAAATAACAGAGTAAAGACAA 
ATATTTCTGC^TGTTTTGAGTGAGCTTAAAGAAGTGGAGCCCAAATTAGCCGCCTGGATCCAAAAACTTA 
ATAGCTTTGGCTTAGATGCCACAGGAACTTTCTTTTCTCGTTATGGAGCATCTCTTAAACTGCTTGATTT 
TAGTATCATGACTTTCCTCTGGAATGAGAAATATGGTCACAA^ 

GATTATTTCTCTGAGCCAGCATCATTGAAGGAAGCCCGTTGTTTAATATGGCTGCTAGAAGAACACAGAG 

AC^GTTCCCAGCATTGCATAGTGCTTTAGATGAATTCTTTGATATAATGGAGAGCCGCTGTACTGTGTT 

AAGGAAACAAGATAGTGGTGAAGCACCGTTTAGTTCAACCAAGGTGAAAAACAAAAGCAAGA 

CCAAAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAACTTCAGTAACTTCAAATAATGAG 

CTTCAAGTGAAGACCATAGCAATCGAAATTCAGATTCTGCAGGCCCATTTGCAGTGCCTGACCATCTTCG 

GCAAGATGTAGAAGAATTCGAAGCTCTCTATGACCAACACAGTiU^CG 

CTATGGGACATGAACCCAAAACAAAAATGTTCAACTCTATATGATTACTO 

ATGGTCCCTTGGAC^TGAGTAACAAGATGTTCTCTGCAGAATATGAGTTTTTCCCAGAAGAAACTCGAC^ 

GATACTAGAAAAAGCAGGAGGTTTAAAACCTTTTGTCTTGGGATGCCCTCGTTTTGTTGTGATTGACAAC 

TGTATTGCACTGAAGAAGGTTGCATCACGGCTCAAGAAAAAAAGGAAGAAGAAAAAGATTAAAAC^ 

TAGAAGAAATTTCAAAAGCAGGGGAGTATGTACGAGTTAAACTACAACTGAATCCAGCTGCTAGGGAATT 

TAAACCAGATGTAAAGTCTAAACCAGTGTCAGATTCATCTTCAGCACCAGCTTTTGAAAATGTGAAACCC 

AAACCTGTGTCTGCAAATTCTCCC7VAGCCAGCTTGTGAAGATGTGAAGGC 

CTTCTAGACAAGTTTCTGAGGATGGGCAACCCAAAGGGGTCTCTTCTAATTCTCCTAAACCAGGCTCT 

GGATGCAAATTACAAGCGAGTCTCCTGTAATTCCCCC^AACCGGTTCTTGAGGATGTGAAACCAACTTAT 

TGGGCTCAATCCCATTTGGTCAC^GGATACTGTACGTA^ 

CACCGCCAGCATAC2VrAAACGTGTTACC^GGTTTGCCCCAGTA(^ 

CCTTTCTCCTGAATATCAGCTACCAAGATCAGTACC^^ 

GATA7UUWVTGCTGCTGCCTATTTTGAGGGTCATCATTTGAATGCTGAGAATGTTGCTGGTCACCAGATTG 
CCTCTGAAAC^CAGATCCTTGAGGGCTCrTTGGGAATATCTGTAAAGTCACACTGCAGCACAGGTO 
TCATACAGTCCTGAGTGAGTCTAACAGAAATGATGAGCACTGTGGAAATTCTAACAACAAATGTGAAGTA 
ATTCCAGAAAGCACCAGTGCAGTAACAAACATTCCACACGTGCAGATGGTTGCCATACAGGTATCTTGGA 
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ACATAATACACCAAGAAGTCAAT^^ 

GATTGAAAAGGAGCACCAAGTATTACAAGACCAACTTCAAGAAGTGTATC 

CTTAAGGGCTTAGAAGAGACCAGGGACCTGGAAGAGAAGCT 

CAAAGACGGAATTAGATTGGTTCCTTCAAGATTT^ 

AGAAATCCAAGAAAGACTAAAATCACTGAAGAAGAAAATTAAAAAGGTTTCAAATC 
ACCGAGAAAAATGATGGAAAGGAAAAGGAACATGAATTACAT 

CACTTACAAATGAGAAAATGAAAATAGAAGAGTATATAAAGAAAGGGAAAGAGGACT 
TCAGAGAGCTGTGGCTGCAGAGGTATCCGTACTT 

ATCATGGAGTCACAAGCAGAAGCCTTTCTGAAGAAGCTGGGGCTGAra 

CTGACATGGAGTCTGATATACGTTCATGGGAATTGTTTCTTTCT 

AAAGTCTCAGTTTGAAGAACAAATTAAGGCAATTAAAAATGGTTCTC 

CAGATTTCTGAGCTTTGATTTCCTGCCTGTAACACGGTTCATCCCGAGTTACT 

ACGATGGCCAAGGGCTTGTGACTTCTGCAAGCGACGTGACTC 

TAGTGTGTTCTCTGCTGGTGATTCCCCAGGGGAGGCT 

CAGCCTGAAGCCACTCAGCTGACAGGGCCAAAACGGGCT 

TGGCTGATCGGAAGCAGCCTGTTCCTCCAGGACGTGCTGCGCGTTCAAGCCAGTCTCCAAAAAA 
CAATAGTATTATTGAGCACCTGTCAGTGGTATTCCCATX3 

AAAAAAGTGCGAAGCAAAAACAAGAACTC^CTCTCAGGATTGAGTATTGATGAAATTCT ■ 

CAGAAC^CATTCTAGATGAACAGAAAAAGAAAAAGCCAAACCCAGGAAAGGACAAGAGGACTT 

CAGCTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCACCCTCGGTGGTTGTTGCACCA 

ACCAAGGGGCAGAAAGCAGAAGATGTCCCTGTGAGGATTGCACTGGGTGCAAGTTCCTGTGAAATATGCC 

ACGAGGTGTTGAAATCAAAAAACGTGCGTGTGCTCAAATGT 

GCAGTGGCTTAAA'GGGCAGAGCGCTTGCCCGGCCTGCCAGGGTCGTGATCTCCTGACAGAAGAGTCACCT 

TCTGGAAGAGGCTGGCCCAGTCAGAATCAGGAGCTGCCTTCCTGCTCTTCTAGGTAGTCACACTTCACTA 

AAGTGTCATCCACCAGTGTGTTGAATCCGAAGAATGACAATTTTCTACCACTGGTGTA 

TTGAAGACCCTTGTGCATTGTGTGTCACAAAGCTAAATACATGGAAATCGTTAATATCGCTGATATTAAG 

TAATTTCCCCACTCTGAGTGAATACTTTGATGATTGCCAACAGTGGCT 

TCATGGGTCACTGGGGCTGCGCAGGGCTCTTTGAGGTGGGTGGCTTCTTTTGGAAAGTACTATGAACGTC 
TCGAAGCAGTATTCTAGTGATAAGAATTCTTAACATAGCCAAGCGCCCCACGTTTGTTCCCCACGTTTGT 
TCCCCTTTTCTGTTTGAAAAACCTGTTCTGGTAGCTCCACAAGAGAGATGATACTGACTTTTTAAATTTT 
TTAG^GAGTCTGTATTCCTGATATGCCTATATTT^ 

AGCAAACTATATTTTAATAATTTATAGTTAATGTTAAAATATTGGCTGATTTAGACCAAAAGATTCAAAT 
CTCCTCTTTGTGAAATCCCATCTG(^TTTGATTTTTTATTATTTTATGTTCCCCCGTTAGATTGTTTTAA 
GTGTTTGCTTTTCATCTTTTATAGATGTAATCTGATTTTC^ 

GACTAAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTCCGTCATCCCAGCCCCCGGTTGGAGCCTGCTC 
TTTGAACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCTCTTCTGTGAGTCAGTCAGCGAGTGCTTGGGATC 
CGCATGCAGCCGTGCTGAGCACAGAACAGGCTGTGTGTGGAAATGGCCACCACCATTCTCCTTCCCCACC 
CCACCAC^AAAAGAGAAGCTGTGTCTTTAGAGAACCCTGAGGTATCTGTGTTAC/ATCGTTCTGTGTTTG 
ATATTTGTGTAAAGTATGCATGCAGTCTTGTACTGTGACCTAAGAACAAAACTGTAACTGCATTAGAAAC 
CATGAAAAAATTAGATATTGTTTTGTGACTTTTAGACAGTGGTAAATATAGAACCATGAATTCTGGTCAC 
ATTCCATTTCTCTCCAACATGAAGGATCAAAAAATC 

AGAGTCATGAGTTTATGAGCTGATTTGGTCACCTTCCTCTGCCTTTGTTCACTGTGAGTTCTGATGTCTT 
AGTGACTTAGTTCTTAGAAGCTCACGCCTTAGTTTGAAACAGATTCTCCACGGTGGTCCCCAAAACACTG 
TCTGCATATCCATAAGAATTGAGCGCTATGGGTGTTAACGTGCATGAGGATCAGTTTGCAGCAGCAAGTA 
CAAAAGGAGAAGAGGAACATCCGTTGAATGAGTGTGTTTTGTACATAACTTCAGATACTTGTGAACATGC 
CTTATATTTGTCCAACAACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 mKNA sequence - var5 (public gi: 2969902) (seq id NO; 206) 

ATATAATGTGAGGGTTTTTTCCTTTTTGCGATTTAGCAGTGCTGATTGTGTATTGCAGTAGTTGTGAGAG 
CATTAGAAGCAGCAGTCGATAGGAGGATGGAAGGTCTGGATGCCGCCTTGGGGAGTTAGGAGATTGGCAG 
ACTTACCCTGTACCACTCTAGCCCTACTCCTTTGCCCAAGACAGAAACACACTGAGATGGATAGGAGAAT 
GTGAGCAGTTGATAGGAi^GTTCTCAGTGGAGTGAGGATTT^ 

AGTTTGTGTATGATGAAATGGCATATTTCACAGAATGCAGTAAAAGCAGTGTAGGGTAAACCAAGTGCAG 

TCAACAGCAAGATGTATTTTCGATGCCAGTTCAACATAAACATCTTATTGTGAGCAGTCTTACCATGTGC 

TAGGCAACTATACAAAACAGATAAGATAAGATGCACGATTGACGATCCTCTATGTAAAGGACGACATGTA 

CAATTCACGTGCTTAACTGAGAGTAGAGATTTGAAGAAACTACAACATCTTGAGTTGATGGAAGAT^ 

TGGATTTGGCAAGGAAAGTTGCTAATGATTCATTCCTTATTGGAGGCTTATTGAGAATTGGTTGTAAAAT 

AGAAAATAAAATCTTGGCAATGGAAGAAGCTCTGAATTGGATAAAATATGCAGGCGATGTAACAATTCTA 

ACTAAATTAGGATCAATTGACAATTGTTGGCCTATGTTAAGTATTTTCTTTACTGAATAC^GTACCACA 

TAACTAAAATTGTAATGGAAGACTGCAATTTGCTTGAAGAACTTAAAACTCAAAGTTGTATGGATTGTAT 

AGAGGAAGGAGGACTAATGAAAATGAAAGGAAATGAAGAGTTTTCCAAAGAAAGATTTGATATAGCTATT 

ATCTATTACACCAGAGCC^TTGAATATAGACOTGAAAACTACOTTCTTTATGGTAACCGAGCTCTTTGTT 

TTCCTCGTACTGGACAGTTTAGAAATGCACTCGGTGATGGAAAGAGAGCCACTATTCTGAAGAAGACTTG 
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GCCAAAGGGTCATTATCGTTATTGTG^ 

AACATAAAAGCTCAAAAACTCTGTAAAAAT^ 

AGTTACAAAAACAAATAGAAGACCTACAAGGTCGAACAGG^ 

AAACAGGGCCTACACACCTAGGAGTTTATCAGCACCTATATTTACT 

GAAAGAGATTTCAGAAAAATTAATCACGAAATGGCCA^^ 

AGGCGTTGAAGGTAGATGATTGTGACTGTCATCCTGAATTTTCACCACCATCAAGTCAGCCTCCA 

TAAAGGAAAACAAAAATCTCGAAACAATGAATCAGAAAAGTTCAGTTC^^ 

GCAGATTTGAAGAACATCTTGGAGAAACAGTTTTCT 

ATATAATGAAAATGCTGAGAAGCTTAATTCAAGATGGCTATATGGCCTTATTGGAGCAG 

CGCTGCACAGGCCTTTACAGAGTTGCTGAACGGTTTAGATCCTCAAAAAATAAAGCAATT^ 

ATGATTAACTATGTTTTGGTCGTCTATGGACTTGC CATTTCTCTCCTTGGAATAGGACAG CCTGAGGAAT 

TATCTGAAGCCX3AAAACCAGTTTAAGAGGATTATTGAACACTACCCCAGTGAGGGCOT 

CTACTGTGGAATTGGAAAAGTGTATTTGAAAAAAAACAGATTTCTAGAAGCT 

GCAAGAACCTTGATTTATCGTCTTCCTGGAGTGTTAACTTGGCCA 

Human TTC3 mRNA sequence - var6 (public gi: 1304131) (seq id no: 207) 

CCTAAAGAAAAGTATTAAGTAAATAGCAGTAC^GATGGCAAATGGATTGCACAATATATCCTCTGGATCC 
ATAGTGACCCTGCAGAGATAAACCTGTGATGGTCAAACAATGTGAAAACTGCTGTCAGAGA 
GGTGCTCTTGTTTACAGAGAAGAGGTGCAAAAATGAACTTGA 
CTGAAATTGAGTATTAAGAACTAATAGACA^ 

CAGTTGGGTATGACTGGCTTCTAGGTTGTGTGTTGTGGAGTGACTGGGGATAAAAGC^ 

CAAAAGGTCTTCTATGCTTATATTAGGGAAGTTGGACTTTATTCTCAAGCTGAAGGGAAGCTGTTGCATG 

GTTTTAAGCAGTAAAGTGATATGATCAGAGTTTTAGAGGATGCCAAGATTGAAGGCAAGTCTGACCAGTT 

AGGAGACTGCTTGTTAAATTAGTTCAGAGGAGAAACAGTGAAGGCAGTGGCACTGGGCATGAAGAAGTAT 

ATGTGTGCTAATTTTAGATTTCTTAGGGAAGCAGAAATGACAAGAGTTAGTGGTCCATTGGACAGAAATA 

TTGAAGGAGACTGGGGAGTCTAGGTTGACTCCCAGGGTTTAGGTTTGGGCAGTAAAATGACATGTAGAAC 

AATTAACTGATAAACAGCATACAGAAAGAGGAAAGAACTTCATTT 

TGTTTGTTTTTGAACTTCCTGATTCAGAGGGGCTTGTGGGACATCTTGGTTAAGATCCTGTAGTAGTTCT 
AGTAGGGTCTAGAAGTCAAGAGATACAACCCCGCCTGGAAGGATTTGGGAGTCTTCAGCATTTGGAATTT 
TGGAAGCGATTGTTTACTGCAGTGCATATGAGATAATTTAAACTGGTACATAGATAAACACTTGAAAAAA 
TTTTAATAGATAGGAATTTAATGTGTATGAGAAACATAACTTGCACATCTAACCTTTGATAATCATGGAC 
ATTATCACTTAGGCCAAGTGAGCTCAATAAAAGGGAAATATTAACTAAAAATATATGTGATACATGGAAA 
TGGCAAAAATC^CGGTGAAGCTATGCAAACGATGC^G^ 

GTGGATAAATTCACCCAAGGAGAGAGTAAGAGTGAGAAGAAAAGAGAGTTGACATTGGGTGGCGGGGAAT 

GGAGAAGAAGAACCCATGAAGGAAACTGAGGAAGAGCAGCCAGACGAATAGGAGGAAAACCAGGAGAAGA 

TGGTCTTTGGAGTCCAAATGAAGAGTTOTGAGGAGGAAGTGGTCCAGAGCGTCAAACTTGTGCA 

ACAATTTTGCTGAGGGAGATTTCACTGTGGCGGATTATGCCTTGTTAGAAGATTGCCCTCACGTGGATGA 

TTGTGTCTTTGCTGCTGAATTTATGAGCAATGATTATGTTCGTGTGACTCAGCTTTACTGTGATGGGGTG j 

GGTGTGCAATATAAAGATTATATCCAAAGTGAGAGGAATTTGGAATTTGACATCTGCAGTATATGGTGTA 

GTAAACCAATTTCTGTCCTGCAAGATTATTGCGATGCCATTAAAAT^ 

TCAACATCAAAACAGTTCCGTAATATC^CGATTC^ 

GAGATAAATTTGAAGAAACTACAACATCTTGAGTTGATGGAAGATATTGTGGATTTGGCAAAGAAAGTTG 
CTAATGATTCATTCCTTATTGGAGGCTTATTGAGAATTGGTTGTAAAATAGAAAATAAAATCTTGGCAAT 
GGAAGAAGCTCTGAATTGGATAAAATATGCAGGCGATGTAACAATTCTAACTAAATTAGGATCAATTGAC 
AATTGTTGGCCTATGTTAAGTATTTTCTTTACTGAATACAAGTACCACATAACTAAAATTGTAATGGAAG 
ACTGCAATTTGCTTGAAGAACTTAAT^ACTCAAAGTTGTATGGATTGTATAGAGGAAGGAGAACTAATGAA 
AATGAAAGGAAATGAAGAGTTTTCCAAAGAAAGATTTGATAT^ 

GAATATAGACCTGAAAACTACCTTCTTTATGGTAACCGAGCTCTTTGTTTTCTTCGTACTGGACAGTTTA 
GAAATGCACTCGGTGATGGAAAGAGAGCCACTATTCTGAAGAACACTTGGCCAAAGGGTCATTATCGTTA 

TTGTGATGCTCTTTCTATGCTGGGGGAATATGACTGGGC^^ 

TGTAAAAATGACCCTGAGGGAATCAAGGATCTAATTCAGCAGCATGTAAAGTTACAAAAACAAATAGAAG 
ACCT ACAAGGT CGAACAGCAAATAAGGATCCAATTAAAGCCTTTTATGAAAAC AGGGC CTACACAC CTAG 
GAGTTTATCAGC^CCTATATTTACTACT 

AATGACGAAATGGCCAACGGTGGTAATC^GAATCTAAAGGTGGCGGATGAGGCGTTGAAGGTAGATGATT 

GTGACTGTCATCCTGAATTTTCACCACCATCAAGTCAGCCTCCAAAACATAAAGGAAAACAA 

AAACAATGAATCAGAAAAGTTCAGTTCTAGTTCACCATTGACTTTACCAGCAGATTTGAAGAACATCTTG 

GAGAAACAGTTTT CTAAATCTTCCAGAGCTGCACACCAGGATT TTGCTAATATAATGAAAATGCTGAGAA 

GCTTAATTC^AGATGGCTATATGGCCTTATTGGAGCAGCGTTGCCGC^GCGCTGCAC^GGCCTTTACAGA 

GTTGCTGAACGGTTTAGATCCTCAAAAAATAAAGCAATTGAACC 

GTCTATGGACTTGCCATTTCTCTCCTTGGAATAGGACAGCCTGAGGAATTATCTGAAGCCGAAAACCAGT 
TTAAGAGGATTATTGAACACTACCCCAGTGAGGGCCTTGATTGCTTGGCCTACTGTGGAATTGGAAAAGT 
GTATTTGAAAAAAAACAGATTTCTAGAAGCTCTCAATC^CTTTGAGAAAGCAAGAACCTTGATTTA 
CTTCCTGGAGTGTTAACTTGGCCCACGAGTAATGTGATTATTGAAGAGTCTCAGCCACAAAAAATAAAGA 
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TGCTGTTAGAGAAATTTGTTGAAGAATGCAAGTTCCCT 

GTGCCATGGATATTCTAAGATCCAGATATACATAACTGATCCAGAC^ 

TGTTGCCAGTACTGTAAAATAGAATTTCACATGAATTGCTGGAAGA 

ATAAAATTGACAAGGATTTTCTAGAAGGAATATO 

TATCATCTTCAGCAGTGGTGGTGAAGTTAAATGTGJ^ 

CCAAGACCTATTCTGAAACAGAAATGTTCTAGCCT^ 

AGAGAAAGATCCAAAAAAAAGAAGCAAAAAAGTTAGCACAAGAAAGAATGGAGGAGGACTTAAG^ 
TAATCCACCCAAAAATGAAGAGCAGAAAGAAACTGTAGACAATGTTCAGCGT^ 
AGAATTCTACAGTGTATAAAGCAGTATGCTGACAAGATTAAATCCGGCAT 
TCAAAGAATTGCTTTCTTGGAAAGTTTTGAGCACAGAAGA 

TCTAAATGAAGCAGTGGACTATGTTATTCGCCACTTGATTCAAGAAAATAACAGA 

TTTCTGCATGTTTTGAGTGAGCTTAAAGAAGTGGAGCCCA 

GCTTTGGCTTAGATGCCACAGGAACTTTCTTTTCTC 

TATCATGACTTTCCTCTGGAATGAGAAATATGGTCACAAACTAGACTCT^ 
TATTTCTCTGAGCCAGCATC^TTGAAGGAAGCCCGTTGTTTAATATGGCTGCT^ 

AGTTCCCAGGATTGCATAGTGCTTTAGATGAATTCTTTGATATAATGGACAGCCGCTGTACTGTGTTAAG 

GAAACAAGATAGTGGTGAAGGACCGTTTAGTTCAACCAAGGT<^^ 

AAGGATTCAAAGCCTATGTTAGTTGGGTCTGGAACAAC 

CAAGTGAAGACCATAGGAATCGAAATTCAGATTCTGCAGGCCCATTTGCAGTGCCTGA 

AGATGTAGAAGAATTCGAAGCTCTCTATGACCAACACAGTAACGAATATGTTGTCCGCAATAAGAAGCTA 

TGGGACATGAACCCAAAACAAAAATGTTCAACTCTATATGATT^ 

GTCCCTTGGACATGAGTAACAAGATGTTCTCTGCAGAATATGAGTTTTTCCCAGAAGAAACTCGAt^ 

ACTAGAAAAAGCAGGAGGTTTAAAACCTTTTCTCTTGGGATGCCCTCGTTTTGTTGTGATTGACAACTGT 

ATTGCACTGAAGAAGGTTGCATCACGGCTCAAGAAAAAAAGGAAGAAGAAAAACATTAAAACAAAAGTAG 

AAGAAATTTCAAAAGCAGGGGAGTATGTACGAGTTAAACTACAACTGAATCCAGCTGCTAGGGAATTTAA 

ACGAGATGTAAAGTCTAAACCAGTGTCAGATTCATC 

CCTGTGTCTGCAAATTCTCCCAAGCCAGCTTGTGAAGATGTGAAGGCCAAACCAGTATCCGACAATTCTT 
CTAGACAAGTTTCTGAGGATGGGCAACCCAAAGGGGTCTCTTCTAATTCTCCTAAACCAGGCTCTGAG^ 
TGCAAATTACAAGQ3AGTCTCCTGTAATTCCCCCAAACCGGTTCTTGAGGATGTGAAA.CCA 
GCTCAATCCCATTTGGTCACAGGATACTGTACGTATCT^^ 

CGCCAGCATACATAAACGTGTTACCAGGTTTGCCCCAGTACACCAGCATATATACACCCTT 

TTCTCCTGAATATCAGCTACCAAGATCAGTACCAGTGGTGCCGTCTTTTGTAGCCAATGACAGAGCAGAT 

AAAAATGCTGCTGCCTATTTTGAGGGTCATCATTTGAATGCTGAGAATGTTGCTCGTCACCAGATTGCCT 

CTGAAACACAGATCCTTGAGGGCTCTTTGGGAATATCTC 

TACAGTCCTGAGTGAGTCTAACAGAAATGATGAGCACTGTGGAAATTC 

CCAGAAAGCACCAGTGCAGTAACAAACATTCCACACGTGCAGATGGTTGCCATACAGGTATCTTGGAACA 

TAATACACCAAGAAGTCAATACTGAGC CATATAATC CTTTTGAGGAACGAC AAGG GG AAATTTCACGGAT 

TGAAAAGGAGCACCAAGTATTACAAGACCAACTTGAAGAAGTGTATGAAAATTATGAGCAGATAAAACTT 

AAGGGCTTAGAAGAGACCAGGGACCTGGAAGAGAAGTTGAAAAGGCACTTAGAAGAAAACAAGATCTCAA 

AGAOSGAATTAGATTGGTTCCTTCAAGATTTGGAAAGAGAAATTAAAAAATGGCAACAGGAAAAAAAA^ 

AATCCAAGAAAGACTAAAATCACTGAAGAAGAAAATTAAAAAGGTTTCAAATGCCAGTGA 

CAGAAAAATGATGGAAAGGAAAAGGAACATGAATTACATCTGGATCAGTCCCTTGAAATCAGCAACACAC 

TTACAAATGAGAAAATGAAAATAGAAGAGTATATAAAGAAAGGGAAAGAGGATTATGAAGAGAGTCATCA 

GAGAGCTGTGGCTGCAGAGGTATCCGTACTTGAAAACTGGAAGGAGAGTGAAGTGTATAAGCTACAGATC 

ATGGAGTCACAAGCAGAAGCCTTTCTGAAGAAGCTGGGGCTGATTAGCCGTGATCCTGCAGCATATCCTG 

ACATGGAGTCTGATATACGTT(^TGGGAATTGTTTCTTTCTAATGTTACAAAAGAAA.TTGAGAAAGCAAA 

GTCTCAGTTTGAAGAACAAATTAAGGCAATTAAAAATGGTTCTCGGCTCAGTGAACTTTCTAAAGTGCAG 

ATTTCTGAGCTTTCATTTCCTGCCTGTAACACGGTTCATCCCGAGTTACTCCCTGAGTCTTCAGGCCACG 

ATGGCCAAGGGCTTGTGACTTCTGCAAGCGACGTGACTGGAAACCACGCAGCACTTCACAGGGATCCTAG 

TGTGTTCTCTGCTGGTGATTCCCCAGGGGAGGCTCCTTCTGCGCTGTTGCCAGGGCCACCCCCTGGTCAG 

CCTGAAGCCACTCAGCTGACAGGGCCAAAACGGGCTGGCGAGGCAGCTCTGTCAGAACGAAGCCCTGTGG 

CTGATCGGAAGCAGCCTGTTCCTCCAGGACGTGCTGCGCGTT 

TAGTATTATTGAGCACCTGTCAGTGGTATTCCCATGTTACAACAGC^CTGAGCTTGC 

AAAGTGCGAAG CAAAAACAAG AACTCACT CT CAGGATTGAGTATTGATGAAATTGTCCAAAGAGTGAGA.G 

AACA(^TTCTAGATGAACAGAAAAAGAAAAAGCCAAACCCAGGAAAGGACAAGAGGACTTATGAGCCCAG 

CTCTGCCACCCCCGTGACCAGGTCCTCCCAGGGCTCACCOT 

AAGGGGCAGAAAGCAGAAGATGTCCCTGTGAGGATTGCACTGGGTGCAAGTTCCTGTGAAATATGCCACG 
AGGTGTTCAAATC^AAAAACGTGCGTGTGCTCAAATGTC^ 

GTGGCTTAAAGGGCAGAGCGCTTGCCCGGCCTGCCAGGGTCGTGATCTCCTGACAGAAGAGTCACCTTCT 
GGAAGAGGCTGGCCCAGTCAGAATCAGGAGCTGCCTTCCTGCTCTTCTAGGTAGTCACACTTCACTAAAG 
TGTCATCCACCAGTGTGTTGAATCCGAAGAATGACAATTTTC^ 

AAGACCCTTGTGCATTGTGTGTCACAAAGCTAAATACATGGAAATCGTTAATATCGCTGATATTAAGTAA 
TTTCCCCACTCTGAGTGAATACTTTGATGATTGCCAACAGTGGCTAATAAAATGACGGCTACCACACTCA 
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TGGGTCACTGGGCTGCGCAGGGCTOT 

AGCAGTATTCTAGTGATAAGAATTCTTAACATAGCCAA 

CTTTTCTGTTTGAAAAACCTGTTCTGG 

AAGAGTCTGTATTCCTGATATGCCTATATTTTTCCTCAA^ 

AACTATATTTTAATAATTTATAGTTAATGTTA 

TCTTTGTGAAATCCCATCTGCATTTGATT^ 

TTGCTTTTCATCTTTTATAGATGTAATCIXSATTTTCAAAAATC^ 
AAGACTTTTTCCCCCTGGAATCGAGGCTGTGTGTC 
AACTCCGCTGCCTTCCTTAGCAGCTTCTGTCCT 
TCCAGCCGTGCTXIAGCACACAACAGGCTGTC^ 

CACAAAAAGAGAAGCTGTGTCTTTAGACAACCCTGAGGTATCTGTGTTACAATCGTO 

TTGTGTAAAGTATGCATGCAGTCTTGTACTGTGA 

AAAAAATTAGATATTGTTTTGTGACTTTTAGAG^^ 

CATTTCTCTCCAACATGAAGGATCAAAAAATGTTTTTCAAT 

GTCATGAGTTTATGAGGCTGGATTTGGGCACCTTC 

TAGTGACTTAGGTCTTAGAAGCTC^CGCCTTAGTTTGAAACAGATTCTCC^ 
GTCTGCATATCCATAAGAATTGAACGCTATGGGTGOT 
ACAAAAGGAGAAGAGGAACATCCGTTGAATGAGTGTC 
CCTTATATTTGTCCAACAACTGTCAGAATAAAGAACATTCTAAAATGAG 

Human TTC3 Protein sequence - varl (public gi: 2662364) (seq id no: 308) 

I KI NI FWPLLFQHQNS SVI SRLHPCVDANNSRASE INLKKLQHLELMEDI VDLAKKVANDS FLI GGLLRI 
GCKI ENKI LAMEEALNWI KYAGDVTI LTKLGS IDNCWPMLS I FFTE YKYHI TKI VMEDCNI1L1EELKTQS C 
MDCIEEGEIiMKMKGNEEFSKERFDIAIIYYTRAIEYR^ 

KNTWPKGHYRYCDALSMLGEYDWALQANI KAQKLCKNDPEGI KDLIQQHVKLQKQIEDLQGRTANKDPI K 
AFYENRAYTPRSLSAPIFTTSLNFVEKERDFRKINHEMANGGN^ 

PPKHKGKQKSRNNESEKFSS SS PLTLPADLKNI LEKQFSKSSRAAHQDFANIMKMIiRSLI PDGYMALLEQ 
RCRSAAQAFTELLNGLDPQKI KQLNLAMINYVLVVYGIiM 
DCIiAYCGIGKVYTjKKNRFLEAIJtfHFEKARTLIYR 

PVPDAICCYQKCHGYSKIQIYITDPDFKGFIRISCC^YCKIEFHMNCW CL 

TPDCEGVISKII I FSSGGEVKCEFEHKVIKEKVPPRPIIjKQKCSSLEKLRLKEDKKL^ 

QERMEEDLRESNPPKNEEQKETVDWQRCQFLDDRIIiQCIKQYADKIKSGIQNTATIiLK^ 

DYTTCFS SRNFLNEAVD YVI RHLI QENNRVKTRI FLHVIjSEIjKEVEPKIiAAWI QKLNS FGIiDATGTFFSR 

YGASLKLLDFSIMTFLWNEKYGHKLDS I EGKQLDYFSEPASLKEARCLI WIjLEEHRDKFPALHSAIiDEFF 

DIMDSRCTVIjRKQDSGEAPFSSTKVKNKSKKKKPKDSKPMLVGSGTTSVTSNNEIITS 

gpfavpdhlrqdveefealydqhsirayvvrnkki.wd^ 
yeffpeetrqilekagglkpfllgcprfwidncialkkvasrlkk^ 

LQLNPAAREFKPDVKS KPVSDSS SAPAFENVKPKPVS ANS PKPACEDVKAKPVSDNS SRQVSEDGQPKGV 
SSNSPKPGSEDANYIOIVSCNSPKPVLEDVKPTYWAQSHIjVTGYCTYLPFQRFDITQTPPAYINVLPGLPQ 

ytsiytplaslspeyqlprsvpwpsfvandradknaaayfeghhi^a 
vkshcstgdahtvlsesnrndehcgnsnnkc^ 

feerqgeisriekehqvwdqi^evyenyeqiklkgleetrdleeklkrhleenkiskteldw 

eikkwqqekkeiqerlkslkkkikkvsnasemytqk^^ 

kgkedyeeshqravaaevsvkenwkesevyklqim^ 

SNVTKVIEKAKSQFEEQIKAI KNGSRIiSELSKVQI SELSFPACNTVHPELLPESSGDDGQGIiVTSASDVT 

gnhaalhrdpsvfsagdspgeapsallpgpppgqpeatqltgpk^gqaalserspvtdrkqpvppgraa 
rssqspkkpfnsiiehlswfpcjtnstelagfikkvrskn^ 
pgkdkrtyepssatpvtrssqgspsvvvapspktkgqkaedv^ 
ghkyhkgcfkqwlkgqsacpacqgrdliiteespsgrgwpsqnqelpscssr 

Human TTC3 Protein sequence - var2 (public gi: 1632766) (seq id no: 309) 

MLGE YDWALQANT KAQKLCKNDPEGIKDLI QQHVKLQKQI EDLQGRTANKDP I KAFYENRAYTPRSLS AP 

IFTTSLNFVEKERDFRKINHErmNGGNQNLKVADEALKVDDCDra 

KFSSSSPLTLPADLKNIIiEKQFSKSSRAAHQDFANIMK^ 

DPQKI KQLNLAMINYVLVVYGIiAI SLI1GIGQPEELSEAENQFKRIIEHYPSEGLDCLAYCGIGKVYI1KKN 

RFLEALNHFEKARTLIYRLPGVIjTWPTSNVIIEESQPQKIKMLLEKFVEECKFPPVPDAI 

KIQI YITDPDFKGFIRISCCQYCKIEFHMNCWKKLKTTTFNDKIDKDFLQGICLTPDCEGVISKI 1 1 FSS 

GGEVKCEFEHKVIKEKVPPRPILKQKCSSLEKLRLKEDKKIjKRKIQKKEA 

EEQKETVDOTQRCQFI^DRILQCIKQYADKIKSGIQOTAM^ 

DYVIRHLIQEimRVKTRIFriHVIiSELKEVEPKLAAWIQKLNSFGI^ 

WNEKYGHKLDS I EGKQLD YFS E PASLKEARCLI WLLEEHRDKFPALHS AIODE FFD I MDSRCTVIiRKQDSG 
EAPFSSTKVKNKSKKKKPKDSKPI^VGSGTTSVTSNNEIITSSEDHSNRNSDSAGPFAVPDHLRQDVEEF 
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EALYDQHSNEYVVRNKKLWDMNPKQKCSTLYDYFSQFLEEHGPI^ 

GIjKPFLLGCPRFVVIDNCXALKKVASRLKKKRKKKNI KTKVEE I SKAGE YVRVKLQIjNPAARE FKPDVKS 
KPVSDSSSAPAFENVTOKPVSANSPKPACEDVT^^ 

VSCNSPKPVLEDVKPTYWAQSHLVTGYCl^PFQRFDITQTPPAYINVLPG 
LPRSVPVVPSFVANDRADKNAAAYFEGHHLNAENVA 

SNRNDEHCGNSNNKCEVI PESTSAVTNI PHVQMVAIQVSWNI IHQEVNTEPYNPFEERQGEISRIEKEHQ 
VIjQDQLQEVYENYEQ I KXrKGIiE ETRDLE EKLKRHLEENKI S KTELtDWFLQDIiERE I KKWQQE KKE I QERL 
KSLKKKI KKVSNAS EMYTQKNDGKEKEHELHIiDQSIiE I SNTIiTNEKMKI EEYI KKGKEDYEESHQRAVAA 
EVSVLENWKESEVYKIjQIMESQAEAFLKKIjGLISRDPAAYPDM^ 
QIKAIKNGSRLSELSKVQISELSFPACNTVHPELLPESSGHDGQGLVTSA^ 

DSPGEAPSALLPGPPPGQPEATQLTGPKRAGQAALSERSPVADRKQPVPPGRAARSSQSPKKPFNSIIEH 
LSVVPPCYNSTEIAGFIKKVRSKNKNSLSGIiSIDEIVQ 
TRSSQGSPSVWAPSPKTKGQKAEDVPVRIALGASSCEICHEVFKSKNVRV 
SACPACQGRDLIiTEESPSGRGWPSQNQELPSCSSR 

Human TTC3 Protein sequence - var3 {public gi: 1632764) (seq id no: 310) 

MKMKGJSTEEFSKERFDIAIIYYTRAIEYRPENYL^^ 

RYCDALSMLGE YDWALiQAiTJ KAQKLCKNDPEGI KDLI QQHVKLQKQ I EDLQGRTANKDP I KAFYENRAYT - 
PRSLSAPIFTTSIiNFVEKERDFRKZNHEMANGGNQNL 

SRNNESEKFSSSSPLTLPADLKNILEKQFSKSSRAAHQDFANIMKMLRSLIQDGYMAI^ 
TEIiLNGLDPQKIKQLNIjAMINYV^ 

KVYLKKNRFLEALNHFEKARTLIYRLPGV^ 

•QKCHGYSKIQIYITDPDFKGFIRISCCQYCKIEFHMNCWKKIjKTTT^ 

Kill FSSGGEVKCEFEHKVI kekvpprpi LKQKCS SLEKLRLKEDKKLKRKI qkkeakklaqermeedlr 
ESNPPKNEEQKETVDNVQRCQFIJ3DRILQCIKQYADKIKSGIQOTAM 

NFLNEAVD YVI RHL I QENNRVKTRI FLHVXiSELKEVE PKLAAW I QKLNS FGLDATGTFF SRYGAS LKL LD 

FSIMTFLWNEKYGHKLDSIEGKQLDYFSEPASLKEARCLIWLLEEHRDKFPALHSAI^ 

LRKQDSGEAP FS STKVKNKS KKKKPKDSKPMLVGSGTTSVTSNNE 1 1 TSSEDHSNRNSDSAGP FAVPDHL 

RQDVEEFEALYDQHSNEYVVRNKKIjWDMNPKQKCST^ 

QILEKAGGLKPFLLGCPRFWIDNCIALKKVASRI4KKKRKKKNI KTKVEEI SKAGEYVRVKLQLNPAARE 
FKPDVKSKPVSDSSSAPAFENVKPKPVSANSPKPACEDVKAKPVSDNSSRQVSEDGQPKGVSSNSPKPGS 
EDANYKRVSCNSPKPVLEDVKPTYWAQSHIjVTGYCT^ 

SLSPEYQLPRSVPVVPSFVANDRADKNAAAYFEGHHLNAENVAGHQIASET^ 
AHT^SESNRNDEHCGNSNNKCEVIPESTSAVTOIPHVQMVAIQVSWNIIHQEVOT 

RIEKEHQVIiQDQLQEVYENYEQI KIiKGLEETRDLEEKLKRHLEENKI SKTELDWFLQDIjERE I KKWQQE K 
KE I QERLKS LKKKI KKVSNAS EM YTQKNDGKEKBHELHIjDQSLE I SNTLTNE KMKI EEYI KKGKED YEES 
HQRAVAAEVSVLENWKESEVYKLQIMESQAEAFLK^ 

AKSQFEEQ I KAI KNGSRLSELS KVQI SELS FPACNTVHPELLPESSGHDGQGLVTSASDVTGNHAAIjHRD 
PSVFSAGDSPGEAPSALLPGPPPGQPEATQl/TGPKRAGQAALSERS PVADRKQPVPPGRAARSSQS pkkp 
FNSIIEHLSWFPCYNSTEIAGFIKKVRSKNKNSLSGL^ 
PSSATPVTRSSQGSPSVWAPSPKTKGQKMDVPVRIAUSASSCEIC^ 
KQWLKGQSACPACQGRDL1LTEESPSGRGWPSQNQEI1PSCSSR 

Hiunan TTC3 Protein sequence - var4 (public gi: 1632762) (seq id no: 311) 

MDNFAEGDFTVADYAIiLEDCPHVDDCVFAAEFMSNDYVROT^ 

CSKPISVIiQDYCDAI KINIFWPLLFQHQNSSVISRLHPCVDANNSRASEINXjKKLQHLEL^ 

VANDS FLIGGLLRIGCKIENKI LAMEEALNWI KYAGDVTILTKLGS IDNCWPMLSI FFTEYKYHITKIVM 

EDCmiLEELKTQS(^CIEEGELMKMKGNEEFSKERFDI^^ 

FRNALGDGKRATILKNTWPKGHYRYCDALSMLGEYDWALQANIKAQKLCKN^ 

EDLQGRTANKDPI KAFYENRAYTPRS LS AP I FTTSLNFVEKERDFRKINHEMANGGNQNLKVADEALKVD 

DOTCHPEFSPPSSQPPKHKGKQKSRNNESEKFSSSSPLTLPADI1KNII1E 

RSLIQDGYMALLEQRCRSAAQAFTELLNGI^PQKIKQLNLAMINYVLW 

QFKRI I EHYPSEGLDCLAYCGIGKVYLKKNRFLEALNHFEKARTLI YRLPGVLTWPTSNVI I EESQPQKI 
KMLLEKFVEECKFPPVPDAICCYQKCHGYSKIQIYITDPDFKGFIRISCCQYCKIEFH^ 
NDKIDKDFLQGICLTPDCEGVISKI II FSSGGEVKCEFEHKVIKEKVPPRPIL^ 
LKRKIQKKKAKKLAQERI^EDIiRESNPPKNEEQKETVDNVQRCQFLDDR 

LLKELLS WKVLSTED YTTCFS S RNFLNEAVDYVI RHLI QENNRVKTRI FLHVLSELKE VEPKLAAWI QKL 
NSFGLDATGTFFSRYGASLKLLDFS IMTFLWNEKYGHKLDS IEGKQIiDYFSEPASLKEARCLI WLLEEHR 
DKFPALHSALDEFFDir^SRCTVLRKQDSGEAPFSSTKV^ 

TSSEDHSNRNSDSAGPFAVPDHLRQDVEEFETUliYDQHSNEYVVRNKKLWDMNPKQKCSTLYDYFSQFLEE 
HGPLDMSNKMFSAEYEFFPEETRQILEKAGGLKPFLLGCPRFWI^^ 

VEE I S KAGE YVRVKLQIiNPAAREFKPDVKS KPVSDSS SAPAFENVKPKPVS ANSPKPACEDVKAKPVSDN 
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SSRQVSEDC^PKGVSSNSPKPGSEDANYKRVSCNSPK^ 
TPPAYINVLPGLPQYTSIYTPLASLSPEYQLPRSVPVVPSFV^ 

ASETQILEGSIjGISVKSHCSTGDAHTVLSESNRNDEHCGNSNNKCEVI pestsavtni phvqmvaiqvsw 
Nil HQEVNTE P YNP FE ERQGE I S R I EKEHQVLQDQLQEVYENYEQ I KLKGIiEETRDIiEEKLKRHLEENKI 
SKTELDWFIX^DLEREIKKWQQEKKEIQERIjKS^ 

TLTNEKMKI EE YI KKGKEDYEESHQRAVAAEVSVLENWKESEVYKLQIMESQAKAFLKKLGL SRDPAAY 

PDMESDIRSWELETiSNVTKEIEKAKSQFEEQIKMKNGSR 

HDGQGLVTSASDVTGNHAALHRDPSVFSAGDSPGE^ 

VADRKQPVPPGRAARS SQS PKKPPNSI IEHLSVVFPCYNSTELAGFI KKVRS KNKNSIiSGLS IDE I VQRV 
TEHILDEQKXKKPNPGKDKRTYEPSSATPVTRSSQGSPSVWAPS 

HEVFKS KNVRVIiKCGHKYHKGCFKQWLKGQSACPACQGRDLLTEES PSGRGWPSQNQELPS CSSR 

Human TTC3 Protein sequence - var5 (public gi: 2969903) (seq id no: 312) 

DLKKLQHLELMEDIVDLARKVAN^ 

WPl^SIFFTEYKYHITKIVMEDCNIiLEEIiKTQSCm 

RPENYLLYGNRALCFPRTGQFRNALGDGKRATILKNTWPKGH^ 

NDPEGIKDLIQQHVKLQKQIEDIXX^RTANKDPIKAFYENRAYTPRSLSAPIFTT 

EMANGGNQNLKVADEALKVDDCDCHPEFSPPSSQPPI^ 

QFSKSSRAAHQDFANIMKMI^SLIQTCYMAL^ 

GLAISLLGIGQPEELSEAENQFKRIIEHYPSEGIjDCIjAYCGIGKVYLKK 
GVLTWP 

Unigene Name: UBE2 N Unigene ID: Hs. 458359 

Human UBE2N mRNA sequence - varl (public gi: 37577134) (SEQ ID NO: 208) 

CGCGCGCGCAGTCGCGCGCGGGTCGTGCCGTACCACCGTCGCGGGCAGGCTCGGCCACGAGCGCCAGAGC 

CCCGC^CCTCCCCTCGCGGCCTGTCCCAAGTCCCTGCCCCGCAACAGAGCGTCACnTCCGCCATCCCC 

CAGCGGTTGGGGCGGGGCGCACGGGGGAGGGGGCCAGGTCGGAGGGAAGCCCGCCCGTGCCCGAGCCCGC 

GCCCGAGCAGGGAGTACATTTCCCGAGGGGCCTCGGCGGCGGCTGCGGCGACGGGCGCGGCAACGTCCCC 

CGGAAGTGGAGCCCGGGACTTCCACTCGTGCGTGAGGCGAGAGGAGCCGGAGACGAGACCAGAGGCCGAA 

CTCGGGTTCTGACAAGATGGCCGGGCTGCCCCGCAGGATCATCAAGGAAACCCAGCGTTTGCTGGCAGA^ 

CCAGTTCCTGGCATCA^GCCGAACCAGATGAGAGCAACGCCCGTTAITT 

CTCAGGATTCCCCCTTTGAGGGAGGGACTTTTAAACTTGAACTATTCCTTCCAGAAGAATACCCAATGGC 

AGCCCCTAAAGTACGTTTCATGACCAAAATTTATCATCCTAATGTAGACAAGTTGGGAAGAATA 

GATATTTTGAAAGATAAGTGGTCCCCAGCACTGCAGATCCGCACAGTTCTGCTATCGATCCAGGCCTTGT 

TAAGTGCTCCCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTGGAAGACCAACGAAGCCCA 

AGCCATAGAAAGAGCTAGAGCATGGACTAGGCTATATGCCATGAATAATATTTAAATTGATACGATCATC 

AAGTGTGCATCZACTTCTCCTGTTCTGCCAAGACTTCCTCCTCTTTGTTTGCATTTAATGGACACAGTCTT 

AGAAACATTACAGAATAA?UUUVGCCCAGACATCTTCAGTCCTTTGGTGATTAAATGCACATTAGCAAA 

TATGTCTTGTCCTGATTCACTGTCATAAAGCATGAGCAGAGGCTAGAAGTATCATCTGGATTGTTGTGfcAA 

ACGTTTAAAAGGAGTGGCCCCTCCCTGCTTTTATTCATTT 

TGAATGAAGGTAGTTGTCAGGTTAGCTGCAGGGGTGTGGGTGTTTTTATTTTATTTTATTTTATTTTAOT 

-TTTGAGGGGGGAGGTAGTTTAATTTTATGGGCTCCTTTCCCCCTTTTTTGGTGATCTAATTGCATTGGTT 

AAAAGCAGCTAACCAGGTCTTTAGAATATGCTCTAGCCAAGTCTAACTTTATTTAGACGCTGTAGATGGA 

CAAGCTTGATTGTTGGAACCAAAATGGGAAGA.TTAAACAAACATCACAGCCCTCACTA 

TCAAGTGTAGATTCCCCCCTTCAAAAAAAGCTTGTGACCATTTTGTATGGCTTGTCTGGAAACTTCTGTA 

AATCTTATGTTTTAGTAAAATATTTTTTGTTATTCTACTTTGCCTTTGTACAGTTTATTTTACTGTGTTT 

ATTTCATTTTCCCAATTTGACAATCGTATTTTAAA 

CCATCTGACAAATTGAATGGCAAGAGGTGGATTTTGCCAGTTT 

GATAGTACTGAATGGAGTATTTATAAACTGGCCCTGAGCATGCATAAAGCATCAGTATCTGACCTTTTTT 

TAACCTTCTAGGAATTTGAAATAAA.TGTGTTTGTGTTGTCTGATTAGATGATCATTGGTGTCTTGCCACA 

ATGTTTAA?^TTACTGTACAGGAAAGTCA(^GCAAAGATAGCAGTTGTGACTGACATGTAGGACTTTCA 

CAGTTGTGCCACATTTTTGCCTAAAATTTGGGTTATGACATTTTTCTTGGTTCTTATCTGA?V7^TTTCAT 

CTGTAACCTTTCATGTGTGTTAAGAAAGACTGATCTGATCATTTGGGATTTGCTGAGGCATTTGTGAGTC 

TTCCTTATAAACCTGATGAGCAGATCTCAACTATCTAGCTTGTGTGTCATCAGAAAGGTTTATCCCTTTG 

AGAGTATCAAGTCCTCAGTTAA.TGATTCTTGCTTTCATCCCTCCAGTATTTGCTGTGGGAGCT 

TTCTTTAATTTGGAATTCAGTAATTTTTCTTC^ 

CCCACCTCTCTCCATATCTAATTCCTGATTCTTGTTATTTTTAAGTCATAAATGTAGCCAGTCATAAATA 

CATAAATGTTAACCTTCGGGTTGCAACCTTGTCTCTTGCAGTTTAAGGTAATGGATATTGTAGCCCATTT 

GAATTTTCTTCACTCTTATTCTCGTAATTCTGGAGTTTCTTCAGATTGTGGTGTATO 

ATGTAAGATGAAGAATTAACTATTAAAATTACATTTTCAACAT^ 

GTATCCTTCCAAATAAATGCATTGCTTGGTAAAAAAAAA/^AAAAAAAA 
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Human UBE2N protein sequence - varl (public gi: 4507793) (seq id no: 313) 

MAGLPRRIIKETQRLIiAEPVPGIKAEPDESN^ 

FMTKI YHPNVDKLGRI CUD I LKDKWS PALQIRTVLLS I QAKLSAPNPDDPLANDVAEQWKTNEAQAI ETA 
PAWTRLYAMNNI 



Human UBE2N pray sequence - varl (SEQ ID NO: 209) 

GCCGCCATGGNGTACCCATACGACGTACC^ 
CAAGCAGTGGTATCAACGCAGAGTGGCCGAACTC^ 

C^TCAAGGAAACCC^GCGTTTGCTGGCAGAACCAGTTCCTGGCATCAAAGCCGA^ 
GCCTOTTATT1TCATGTGGTCATTGCTGGCCCTCAGGATTCCC 
AACTATTCCTTCCAGAAGAATACCCAATGGCAGCCCCT 
ACAGTTCTGCTATCGATCCAGGCCTTGTTAAGTGCTCC 

CGGAGCAGTGGAAGACCAACGAAGCCCAAGCCATAGAAACAGCTAGAGCATGGACTAGGCTATATGCCAT 

GAATAATATTTAAATTGATACGATCATCAAGTGTGCATCACTTCTCCTGTTCTGCGAAGAOT 

TTTGTTTGCATTTAATGGACACAGTCI^AGAAACATTAC^ 

TNGGTGATTAAATGCACATTANCAAATNTTGCQ 

AAATTTNATCTGGATNNTNGGAAACnsrTTNAAAAC^ 

•NTNAANTTAAACC OTGGAATNANGGNNTTTNCNGNNACNNNGGG GGT 

Human UBE2N pray sequence - var2 (SEQ ID NO: 210) 

CGAGOSCCGCCTGGNNTACCC^TACGACGTACCA^ 

TCCACCC^GCAGTGGTATCAACGCAGAGTGGC(^TTATGGCCGGGGGAGAGGAGCCGGAGACGAGAC(^ 
GAGGCCGAACTCGGGTTCTGACAAGATGGCCGGGCTGCCCCGCAGGATCATCAAGGAAACCCAGCGTTTG 
CTGGC^GAACCAGTTCCTCGCATCAAA 

TTGCTGGCCCTCAGGATTCCCCCTTTGAGGGAGGGACTTTTAAACTTGA^ 

CCCAATGGCAGCCCCTAAAGTACGTTTCATGACCAAAATTTATCATCCTAATGTAGAC^^ 

ATATGTTTAGATATTTTGAAAGATAAGTGGTCCCCAGCACTGCAGATCCGCACAGTTCTGCTATCGATCC 

AGGCCTTGTTAAGTGCTCCCAATCCAGATGATCCATTAGCAAATGATGTAGCGGAGCAGTGGAAGACCZ^ 

CGAAGCCCAAGCCATAGAAACAGCTAGAGCATGGACTAGGCTATATGCCATGAATAATATTTAAATTGAT 

ACGATCATCAAGTGTGCATCACTTCTCCTGTTCTGC 

CAC^GTCTTANAAACNTTNNGAATAAAAANC^ 

NNNTTNTNTTGNCNGNTNCOTGNNTAAACCTO 

ANNNTGNCCCCCNNNTCT^ 

GGGNGGGNTTTTTTTTTTTTTTTTTNTN 



Unigene Name: UNC84B Unigene ID; Hs , 406612 

Human UNC84B mRN A sequence - varl (public gi: 31742497) (SEQ ID NO: 211) 

CCGCCCGCCCGCCTTGTCCCGCGTCGCCGCTCTTCGCGTGCCCGCGCGCCCCGGGCCCGGCCGCTGTGTC 
GCCCTGAGCGGAGCGCCGCCCGCGGGATCCCCACGCGGAAAGGGGGCGCGCCCCGGCGGCGGCCTGGCCT 
CGGACGCCCCCGGCCGGGCTAGAAGCCGCCGCGGCAGCAGATTCTCTTCAGGGGAAGAGTCCACATCCCA 
CCTCATCATGTCCCGAAGAAGCCAGCGCCTCACGCGCTACTCCCAGGGTGACGATGACGGCAGCAGCAGC 
AGCGGAGGGAGCTCGGTGGCTGGGAGTCAGAGCACCCTGTTTAAA 

GGAAATCCAGCAAC^TGAAGCGCCTGTCCCCAGCGCGACAGCTGGGCCCGTCCTCTGATGC^ 
CTACTACAGTGAGTCGCTGGTCCACGAGTCCTGGTTCCG^ 

GACGCCAACTGGGGTGAGGACCTGCGGGTGCGGAGGAGGAGAGGCACGGGTGGCTCAGAGAGC^GGAGOT 

CCAGCGGGCTTGTGGGGCGCAAGGCGACCGAGGACTTCCTGGGCTCTTCCTCGGGCTACTCCTCTGAGG 

CGACTACGTGGGCTACTCGGATGTGGACCAGCAGAGTTCCAGCTCGCGGCTCCGAAGCGCCGTCTCACGG 

GCGGGCTCCTTACTCTGGATGGTGGCCACTTCGCCAGGCCGGCTCTTCAGACTTCTCTACTGGTGGGCTG 

GCACCACCTGGTACCGCCTGACCACAGCTGCCTCCCTCCTTGACGTCTTCGTTTTAACCAGGCGC^ 

GTCCCTGAAGACGTTCCTCTGGTTCCTGCTGCCGCTO 

TATTTCTACCCCTATGGGCTGCAGACATTCCACCCT 

GGAGGCCGGATGAGGGCTGGGAAGCCAGAGACTCATCGCCACATTTCCAGGCTGAGCAGCGTGTTATGTC 
CCGGGTACACTCTCTGGAGCGGCGTCTGGAAGCTCTTGCTC 

GCCATGCGGCTGGAACGTCTGGAGCTGCGGCAAGGGGCTCCTGGCCAGGGAGGTGGTGGTGGCCTGAGCC 
ACGAGGAC^CCCTGGCGCTGCTGGAGGGGCTAGT^ 

CAGGGAAACTGCTGCTCGCATCCAGGAAGAACTGTCTGCCCTGAGAGCAGAGCATCAGC^GACTCAGAA 
GACCTCTTCAAGAAGATCGTCCGGGCCTCCCAGGAGTCCGAGGCTCGCATCCAGCAGCTGAAGTCAGAGT 

X 



Figure 36 part - 135 



180/202 



WO 2004/078130 



PCT/US2004/006308 



XK5CAAAGCATGACCCAGGAGTCCTTCCAGGAGAGCTCTGTC 

GGCCGGCCTGCAGCAGGAGCTGGCGGCTCTGGCACTGAAGCAGAGCTCGGTGG 

CTGCCCCAGCAGATCCAGGCCGTGCGGGACGACX3TGGAATCTCAGTTCCCGG^ 

ITGCCCGAGGTGGAGGGGGCCGCGTGGGGCTCCTTCAGAGAGAGGAGATGC^ 

GGAGAGCAAGATCCTCACCCATGTGGCAGAGATGCAGGGCAAGTCGGCCAGGGAAGCCGCGG^ 

AGCCTGACGCTGCAGAAAGAAGGTGTGATTGGAGTGACAGAGGAGCAC^ 

CCCTGCAGCGCTAGAGTGAGGACCGCATCGGGCTGGCAGACT^ 

CATCAGCACCCGATGTTCTGAGACCTACGAGACC^GACGGCCCTC 

TGGTACCACTCCCAGTCACCCCGAGTCATCCTCCAGCCAGATGTGCACCCAGGCAACTGCTGGGCCTTCC 

AGGGGCGACAAGGCTTCGCCGTGGTCCX3COTCTCTGCCCGCATCCGCCCCACAGCCGT 

TGTGCCCAAGGCCTTGTCACCCAACAGCACTATCTCCAGTGCCCCCAAGGACTTCGCCATCTTTGGGTTT 

GACGAAGACCTGCAGCAGGAGGGGACACTCCrrGGCAAGTTCACTTACGATCAGGAC^ 

AGACGTTTCACTTTCAGGCCCCTACGATGGCCACGTACCAGGTGGTGGAGCTGCGG^ 

GGGCCACCCCGAGTACACCTGCATCTACCGCTTCAGAGTGCATGGGGAGCCCGCCCACTAGCCCTGCTT^ 

CTGGTGCCTGCTGCCAGCCATCTGGGAGTGGGTGAACAGCACCCCGCCGCTTCCCCCACACGCTTGCTCG 

G CGCTCTGACTTCTAGG AGCACAAGAGAGGAGCCTGTGGC CCGATC 

TCCTGAGCAGCAGGTGGCTCGAGGCGGTTAGCAGGCTCCAG 

GTGGCGTCTGCTTCCCATCCTGGGAGTGTGTATATATGTAGCATATCATGGGGGACTGGGAAGTTGGGAG 

AGGTAGGACCTGACTGGTCTTGGCTGGGGTCAGGGGCTGGTGCCTGGGAGCTGATGAAGCAGGTGCCAGG 

GCTGTGGGAGGGGCAAGCTACGGCCTGGGCTAGGTGAGCTGCCTCTGCCCCTGGGCAAGGAAGCGAGGCC* 

CTCTGGGAGGAGGGTGCTTAGCTCCAGAGCAGGATGGGACTTCCCCAGGCAGGAAGCACTTGATGGAGAG 

CTGCCCAGCTCTCCTACAAGGTTAGTGCCCTCCACCTAGGGAAGCATGAACCACAGGGTCCCTGAGGGCC 

TTCGACAAAAGTGTGTATTTGTCCCGGGGAGGGTAGCAGTGGGCCATGGGGCTTCTTGTGCCCTAAAGGG 

GACTGGCTGCTGTGATCTTCTAAGGGGCCCAGGGCCAACCCTGTAGGCTTCCCCTCTGCTGGGGACGGTA 

GTTGCTTTTCTCTCTCCTGATGCTAGGTTGGGGCCCACCCTGCTCCCTGTTCCTGCTAGGGCCTGCCAGT 

GCCCCTX^GCTTGCTTTCCACATTCTCCCAGGGTATGGAGACCTAGACCTGTCTTTGGGGCCATTAGCAT 

CTGGGGTTATAGCAAGAAGAGTGGGGAGCATGGAACTCCTGGGCTCTTGTGGGGACGTTCAGGGTATCGG 

GGTGCGAGGTCTGTCTGCACCGGCCCCCACATCTAACCAGGCCCTGATGTAGGGGTCGTCCGCTCAGGCT 

GCCCCCTTGGGCTCTTGCAGCTCTTGTTC^GGTAGTCGCCCTTCTGGTTTGTTCTCTGTGGGGCAGTTGG 

TGGGGGCTGGGGGAAGAGGCTGGCAGAAGTTACCCTGGATAGGGAAGGGGGAGGAGGGGACTTTTAGAGC 

CAGCAGGCCCCACTGTATTATGTATATATTTTTCAAGGTCTGTTTTTCTAACTGAAAAGCTAAG 

ATTCCTAGCCCCGTTCTGTGGGGCACTGGGTGATACTCAGTTTCTTGTTCCTGGCCGTGGAGAGGGGCCT 

GGGGCACTGGTTCCGGCTGTGTCTGGTGGTCTGGCTGCGGGGAAGGGGCAAGAAGGCGGGCAGGCCTTCA 

CTGGAGCACTGAGCCTCAAATCCGCTCTGGAGCATGAGGCTC 

CCATCCCGAGGCAGCCAGGGTTTGTTTTGCGCTCTCCTGTCACAAATGCTGCACTATTGGTTCTTAAGTT 
TTTTATCTCCAGATCCTAATTTATGCCTATGCAAAAAATAAATGACGCGCAAGAGCTG 



Human UNC84B protein sequence - varl (public gi: 31742498) (seq id no: 314) 

MSRRSQRIiTRYSQGDDDGSSSSGGSSVAGSQSTLFKDSPLRTLKRKSSNMKRLSPAPQLGPSSDAHTS^ 
SESI4VHESWFPPRSSI1EELHGDANWGEDLRVRRRRGTGGSESSRASGLVGRKATEDFLGSSSGYSSEDDY 
VGYSDVDQQSSSSRLRSAVSRAGSLLWMVATSPGRLFRLLYVWAGTTWYRIiTTAASLLDVFVLTRRFSSL 
KTFLWFLLPIjLLIjTCLTYGAWYFYPYGIjQTFHPALVSWWAAKDS rrpdegweards sphfqaeqrvmsrv 
HSLERRLEAIiAAEFSSNWQKEAMRLERLELRQGAPGQGGGGGLSHEDTLAIiLEGLVS 

TAARIQEELSALRAEHQQDSEDLFKKIVRASQESEARIQQLK^^ ■ 
LQQEIjAALAIjKQSSVAEEVGLLPQQIQAVRDDVESQ 

KILTHVAEMQGKSAREAAASLSLTIxQKEGVIGVTEEQVHHIVKQALQRYSEDRIGLADYA^ 
TRCSETYETKTALLSLFGIPLWYHSQSPRVILQPDVHPGNCTAFQGPQGFAVVRLSARIRPTAVTLEHVP 
KAIiS PNSTI SSAPKDFAI FGFDEDLQQEGTLLGKFTYDQDGEP I QTFHFQAPTMATYQWELRI LTNWGH 
PEYTCIYRFRVHGEPAH 

Human UNC84B pray sequence - varl (SEQ ID NO: 212). 

GATTTGGNAATNCTACAGGGNATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATTCGATG 
ATGAAGATACCCCACCAAACCCAAAAAAAGAGATCTTTAATACGACTCGACTATAGGGCGAGCGCCGC^ 
TGGAGTACCCATACGACGTACCAGATTACGCTCATATGGCCATGGAGGCCAGTGAATTCCACCCAAGCAG 
TGGTATCAACGCATAGTGGAAAAGCATGAGCCAGGAGTCCTTCCAGGAGAGCTCTGTGAAGGAGCTGAGG 
CGGCTGGAGGACCAGCTGGCCGGCCTGCAGCAGGAGCTGGCGGCTCTGGCACTGAAGCAGAGCTCGGTGG 
CGG7^AGAAGTGGGCCTGCTGCCCCAGCAGATCCAGGCCGTGCGGGACGACGTGGAATCTCAGTTCCCGGC 
CTGGATCAGTCAGTTCCTTGCCCGAGGTGGAGGGGGCCGCGTGGGGCTCCTTCAGAGAGAGGAGATGCAA 
GCTCAGCTGCGAGAGCTGGAGAGCAAGATCCTCACCCATGTGGCAGAGATGCAGGGCAAGTCGGCCAGGG 
AAGCCGCGGCCTCCCTGAGCCTGACGCTNCANAAAGAAGGTGTGATTGGAGTGACAGAGGAGCAGGTGCA 
CCACATCGTGAAGCAGGCCCTGCAGCGCTACAGTGAGGACCGCATCGGGCTGGCAGACTACGCCCTGGAG 
TCAGGAGGGGCCAGCGTCATCAGCACCCGATGTTCTGAGACCTACNAGACCAAGACGGNCCTNCTCAGCC 
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TCTTNGGNATCCCCCTGGGGT^^ 
TGNTGGGCCTTNCAGGGGCCANNGGGNTTNNCC^ 

Human UNC84B pray sequence - var2 (SEQIDN0:213) 

CGCCGCGAIX^TAGTACCCATACGACGTACCAGTA 

CCACCCAAGCAGTGGTATCAACGCAGAGTGGCCATTATGGCT^ 

TTCACTTTCAGGCCCCTACGATGGCCACGTACCAGGTGGTGGAG 

CCCCGAGTACACCTGCATCTACCGCTTCAGAGTGCATGGGGAGCCCGCCCACTAGCCC^ 
CCCGCTGCC^GCCATCTGGGAGTGGGTGAACAGCACCCOT 

TGACTTCTAGGAGCACAAGAGAGGAGCCTGTGGCCCCATGCAGATGAAAAGGACGGGCAGGGTCTCCT 

GCANCAGGTGGCTCGAGGCX^TTAGCANGCTCCANCAGCTCCCTTCT 
TCTGCTTCCCATCCTGGGAGTGTGTNTATATNTANCATATCATGGGGGACTGG 

Unigene Name : VCY2IP1 Unigene ID ; Hs. 66048 Clone ID : GD_181 
Human VCY2BP1 mRNA sequence - varl (public gi: 22002952) (SEQ ID NO: 214) 

AAGATGGCGGCGGTGGCTGGATCTGGGGCTGCCGCGGCTCCGAGCTCA.CTGCTCCTCGTGGTGGGCAGTO 

AGTTCGGGAGCCCGGGGCTCCTCACCTACGTCCTGGAGGAGCTCGAAAGAGGCATCCGGTCTTG 

CGATCCTGGCGTCTGCAACCTTGATGAAGAGCT 

AGCATTGTGAAAGGCCAGCGGAGCCTGCACC^CCGTGGAGACAACCTGGAC^ 
CATCAGACAAGTCCCTGTATGATGAGCTCCGGAACCTTCTGTTGG 

GTTGGCTGGGCTCTGCCTGGAGGAGACGGGGGAGCTGCTGC'TACAGACAGGGGGCTTCTCGCCTCACCAC 

TTCCTCCAGGTCCTGAAGGACAGAGAGATCCGGGACATCCTGGCCACC^CGCCCCCACCTGTGCAGCCGC 

CC^TACTCACCATCACCTGCCCCACCTTCGGTGACTGGGCTCAGCCGGC^CCCGCTGTGCCTGGCCTT^ 

GGGGGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCCGGCGCAGCTGCCCAACTCTGAGGGCCTGTGCGAA 

TTCCTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCTCCG 

GGGGCTTCCTCAGGCTGGGCCGGCCCTGCTGCTACATCTTCCCTGGAGGCCTCGGGGATGCCGCCTTCnT 

CGCCGTC^TGGCTTCACTGTGCTGGTCAACXMTGGCT(^^ 

CGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCT^ 

GCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCT 
GCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCGGCTGGCG 
CGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCACTCA 
GCCGCGGCCCCXSTGCCAGCCAAACCCACCGTGCTCTTC^ 

TGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGGCAC 
CCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTGCCTCCTGG 
ACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTC 

GCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCC 
ACCC^CCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCCCAC 
GC^GACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAGGAAAGACCCC^VAACCGAGTGTCTCCCGGAC 
CCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGCA 
CCCAAGCCCCGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCGCAGCC 
CGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTGGT 
GGCCACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCC 
GCCAGCTCAATCCCAAGGCC^CGC^C^CCCTCCC 

GGCTGTCGCTGAGCCC^CTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCACCCACAGTGACCACACCCAC 
GGTGACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTGTCG 
GTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTG 
GCCCCCGGGCGCGGCGCTCGGCTTCCCCAGACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGA 
GCATCGCAAGGCGGTGCGAATGGCACCGGCACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTGCCCGG 
TC^CAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTGCCCA 
CCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGGCTT 
TGGAGTCCCTCGCCACXSACCCrTTGCCT 

ATCTGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACCC 
GGAAGCCCCTGGCCCGCCCCAACTCACGCGCTC^ 

CAAGGGGCTTGCTGGTGGGGACCGTGCCAGCCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGGA 
GGCCGGGCACCCCTGTCCAGAAAGTCCTCAACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAGC^ 
GCAGCCGGCCCGGGGTGTCAGCCACCCGACCCAAGTCC^ 

CGGGAGGAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGCGCTCTGCTACGTCATCAGT 
GGCCAGGACCAGCGGAAGGAGGAAGGCATGCXXKSCCGTC^ 

GGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGCATACGTGGTACGCAGA 
GACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTGTTGGGCAG 

GACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGCCCG 
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CCTGTCCCTAGATTCAGCCACATCAGAAATAAACTGTGACTAC 

Human VCY2EP1 mRNA sequence- var2 (public gi: 21739762) (SEQ ID NO: 215) 

CCGAAGATGGCGGCGGTGGCT 

GCGAGTTCGGGAGCCCGGGGCTCCTCACCTACGTCCTGGAGC^ 
TGTCGATCCTGGCGTCTGCAACCTTGATGAACAGCT 
TCCAGCATTGTGAAAGGCCAGCGGAGCCTGCACCACCGTGGAGACAAC 
ACCCATCAGACAAGTCCCTGTATGATGAGCTCCGGAACCTTCTGTTGGACCCT^ 

GGTGTTGGCTGGGCCCTGCCTGGAGGAGA03GGGGAGCTGCTGCT 
GACTTCCTCCAGGTCCTGAAGGACAGAGAGATCCXX3GACATCCTGGC(^ 
CGCCCATACTGACGATCACCTGCCCCACCIT 
TCAGGGGGCGCTCCX3GCTCCAGCTGCGGCTGAAC 

GAATTCCTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCT 

CCGGGGGCTTCCTCAGGCTGGGCCGGCCCTGCTGCTACATCTTCCCTGGAGGCCTCGGGGATGCCGCCTT 

CTTCGCCGTCAATGGCTTCACTGTGCTGGTCAACGGTGGCTCAAACCCCAAGTCCAGTTTCT 

GTGCGGCACCTGGACCGCGTGGATGCCX5TGCTGGTGACCCACCCT 

ACAGCC^CTGCGGCGCAAAOTGGCGGAGOSCTCCGAGGTGGCTGCTGGTGG 

GCTGCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTC 

GCGCG03GCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCAT 
TCAGCTCCGGCCCCGTGCCAGCCAAACCCACCGTGCTCT^^ 

GTATGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGG 

CACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGCX5TGCTGTTCCCCGGTTGCACCCCGCCCGCCTACCTCC 

TGGACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGA 

GGGGCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAA^ 

GCCACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCGAGCACGGGCT 

CACGCAAGACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCAAACCGAGTGTCTCCCG 

GACCCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCG 

GC^CCCJ^GCCCCGC^^GCGCCC^GC^CGTCCCACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCG^ 

GCCCGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCT 

GGTGGCCA.CGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACT 

<3CCGCCAGCTCAATCCCAAGGCCACGCACACCCTCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCG 

AGCGGCTGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCACCCACAGTGACCACACC 

CACGGTGACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTG 

TCGGTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGC 

GTGGCCCCCGGGCGCGGCGCTCGGCTTCCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATT 

TGAGCATCGCAAGGCGGTGCCAATGGCACCGK3CACCT 

CGGTCACAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTGC 
CC^CCCTGTCTGACTCGGATCCCGTGCCCCOXSGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGG 
CTTTGGAGTCCCTCGCCACGACCCTTTGCCTGACCCCCTCAAGGTCCCCCCACCACTGCCTGACCCATCC 
AGCATCTGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCA 
CCCGGAAGCCCCTGGCCCGCCCC^^CTC7tf:GCGCTGCCGCCCCC^U^ 

AACCAAGGGGCTTGCTGGTGGGGACCGTGCCAGC CGACCACTCAGTGCCCGGAGTGAGC CCAGTGAGAAG 
GGAGGCCGGGCACCCCTGTCCAGAAAGTCCTCAACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAG 
CCAGCAGCCGGCCCXSGGGTGTCAGCCACCCCACCCT^TC 

CAGCGGGAGGAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGCGCTCTGCTACGTCATC 

AGTGGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGC 

ATTGGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGCATACGTGGTACGC 

AGAGACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTGTTGGGCAGCAACAGCATGGTGTCCATGCAG 

GATGACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGC 

CCGCCTGTCCCTAGATTCAGCCAC^TCAGAAATAAACTGTGACTACACTTGGTAAAAAAAAAAA 

AA 

Human VCY2IP1 mRNA sequence - var3 (public gi: 21104445) (SEQ ID NO: 216) 

CCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCTGCGCAGGCTCATCTCCCCCAACCTGGGGGT 
CGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCGGCTGGCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTG 
AGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCACTCAGCCGCGGCCCCGTGCCAGCCAAACCCACCG 
TGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTATGTGCTGCACCCGCCCTCCGCCGGCGCCGA 
GCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGGCACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGC 
GTGCTGTTCCCCGGTTGCACCCCGCCCGCOTGCCTCCTGGACGGCCTGGTCCGCCTGCAGCACTTGAGGT 
TCCTGCGAGAGCCGGTGGTGACGCCCCAGGACCTGGAGGGGCCGGGGCGAGCCGAGAGCAAAGAGAGCGT 
. GGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCCACCCACCCTAGACCTGGCCAGGAGCGCCCT 
GGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCCCACGCAAGACTGAGAAAGAAGCCAAGGCCCCCC 
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-GGGAGTTGAAGAAAGACCCCAAACCX^GTGTCTCC 

TTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGC^ 

CACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCGCAGCCC^ 

CCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTG^ 

CCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCCGCCAGCT 

TC^CCTGAGTCCCACCXaGAGCCCCGCAGAGGGCAGC^ 

AGGCCGGGCCAGACGCCTCACCGACAGTGACCACACCCACGGTGACCACGCCCT 

GGGCTCCCCGGACTCGACCX5AGGTGGACGAGTCCCTGTCGGTGTCCTTTGAGCAGGTGCTGCC 

GCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTGGC^ 

ACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGAGCATCGCAA 

ACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTGC^ 

GAGGAGACGCGACCCAC^TCGGTGAGCGAGTCCCTG^ 

CCCCCGGTGCXX3CAGACTCAGACGAAGACACAGAGGGCTTTGGAGTCCCTCGCCACGACC 
CCCCCTCAAGGTCCCCC(^CCACTGCCTGACCCATCCAGCATCTGCATGGTGGACCCCGAGATGOT 
CCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACCCGGAAGCCCCTGGCCCGCCCCAACTCACGCG 
CTGCCGCCCCCAAAGCCACTCCAGTGGCTGCTGCC^AAACCAAGGGGCT^ 

CCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGGAGGCCGGGC^CCCCTGTCCAGAAAGTCCTC^ 
ACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAGCCAGCAGCCGGCCCGGGGTGTCAGCCACCCCAC 
CCAAGTCCCCGGTCTACCTGGACCTGGCCTACCTGCCCAGCGGGAG 
GTTCTTCCAGCGCGTGCGCGCXSCTCTGCTACGTC^^ 

CGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGCATTGGGACCGTGACCTGCAGGTGACCCTGATCC 
CCACTTTCGACTCGGTGGCCATGCATACGTGGTACGCAGAGACGCACGCCCGGCACCAGGCGCTGGGCAT ' 
GACGGTGTTGGGCAGCAkCAGCATGGTGTCCATC 

TAGCCCCATCGCCGACACGCCCCCC^CTCAGCCCAGCCCGCCTGTCCCTAGATTCAGCCACATC^Gi^T 
AAACTGTGACTTCCAAAAAAAAAAA 

Human VCY2IP1 mRNA sequence - var4 (public gi: 14250679) (seq id no: 217) 

GGGACGAGGCCGCCTTCTTCGCCGTCAATGGCTTCACTGTGCTGGTCAACGGTGGCTC^ 
CAGTTTCTGGAAGCTGGTGCGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGAC 
AGCCTCCCCGGCCTCAACAGCCTGCTGCGGCGCAAACTGGCGGAGC 
GCTCCTGGGACGACAGGCTGCGCAGGCTCATCTCCCCC^ 

GGCCGCGTCGCGGCTGGCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGC 
ATCACGCCTCTGCCACTCAGCCGCGGCCCCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCG 
TGGGCCGGCTGGACATGTATGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTG 
GGCCCTGCTGGTGTGGCACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACC 
CCGCCCGCCTGCCTCCTGGACGGCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGA 
CGCCCCAGGACCTGGAGGGGCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAA 
GAGAGAGGGCCTCCTGGCCACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCA 
GCACGGGCraAGGCCCC^CGC^GACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCA 
AACCGAGTGTCTCCCGGACCCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAA 
GACGAATGCCCAGGCGGCACCCAAGCCCCGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTG 
GCAAATGGACCCCGCAGCCCGCCCAGCCTCCGATGTGGAGAAGCCAGC^ 

CTCCGGCCTCCCAGCTGGTGGCCACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGC 

ACTGGAGCTGCCTTTGGCCGCCAGCTCAATCCC^GGCCACGCACACCCTCCCCTGAGTCCCACCGGAGC 

CCCGCAGAGGGCAGCGAGCGGCTGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCAC 

CCACAGTGACCACACCCACGGTGACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGA 

GGTGGACGAGTCCCTGTCGGTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGG 

CTGAGCCTCCCGCTGCGTGGCCCCCGGGCGCGGCGOTCGGCTTCCCCACACGATGTGGACCTGTGCCTGG 

TGTCACCCTGTGAATTTGAGCATCGCAAGGCGGTGCCAATGGCACCGGCACCTGCGTC 

GAATGACAGCAGTGCCCGGTCACAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCG 

GTC^GCGAGTCCCTGCCCACCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAG 

ACGAAGACACAGAGGGCTTTGGAGTCCCTCGCCACGACCCTTTGCCTGACCCCCTCAAGGTCCCCCCACC 

ACTGCCTGACCCATCCAGCATCTGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACAGCA 

GAGAACGTCAGCCXK!IACCCGGAAGCCCCTGGCCCGCCCCAACrC7VCGCGCTGCCGCCCCCAAAGCCACTC 

(ZAGTGGCTGCTGCCAAAACCAAGGGGCTTGCTGGTGGGGACCGTGCCAGCCGACCACTCAGT^ 

TGAGCCCAGTGAGAAGGGAGGCCGGGCACCCCTGTCCAGAAAGTCCTCAACCCCCAAGACTGCCACTCGA 

GGCCCGTCGGGGTCAGCCAGCAGCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGTCTACCTGG 

ACCTGGCCTACCTGCCCAGCGGGAGCAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGC 

GCTCTGCTACGTCATCAGTGGCCAGGACCAGCGGAAGGAGG 

CTGGCCAGCAAGCAGCATTGGGACCGTGACCTGC^GGTGACCCTGATCCCCACTTTCGACTCGGTGGC^ 
TGCATACGTGGTACGCAGAGACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTGTTC 
CATGGTGTCCATGCAGGATGACGCCTTCCCGGCTTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCC 
CCCCACTGAGCCCAGCCCGCCTGTCCCTAGATTCAGCCACATCAGAAATAAACTGTGACTACACTTGAAA 
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AAAAAAAAAAAAAAAAAAAAAAAA 

Human VCY2IP1 mRNA sequence - var5 (public gi: 13938254) (SEQIDNO:2l8) 

GACACCGACAGGGACTCGTCCACCTCGGTGTCCTTTGAGCAGGTGCTC 

AGGCTOGGOTGAGCCTCCCGCrGCXTGGCCCCCGGGCGCGGC 

GTGCCTGGTGTCACCCTOTGAATTTGA 

GGCAGCTCGAATGACAGCAGTGCCCGGTCACAGG^^ 

CCAC^TCX3GTCAGCGAGTCCCTGCCCACCCT^ 

AGACTCAGACGAAGACACAGAGGGCTTTGGAGTCCCTCGCCACGACCCTTTGCCTGACCCC 

CCCCCACC^CTGCCTGACCCATCCAGCATCTGCATGGTGGACCCCGAGATGCTGCCCCCC^AGACAGCAC 

GGCAAACGGAGAACGTCAGCCGCACCCGGAAGCCCCT^ 

AGCCACTCGAGTGGCTGCTGCCAAAACCAAG 

GCCCGGAGTGAGCCCAGTGAGAAGGGAGGCCGGGCACCCCTC 

CCACTCGAGGCCCGTCGGGGTCAGCCAGCAGCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGT 

CTACCTGGACCTGGCCTACCTGCCCAGCGGGAGGAGCGCCCACCTGGTGGATGAGGAGTTCTT 

GTGCGCGCGCTCTGCTACGTCATCAGTGGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGG 

ACGCGCTACTGGCCAGCAAGCAGCATTGGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACT 

GGTGGCCATGCATACGTGGTACGCAGAGACGCACGCCCGGCACC^ 

AGCAACAGCATGGTGTCCATGCAGGATGACGCCTTCCCGGCCrGCAAGGT 

GACACGCCCCCCACTCAGCCCAGCCCGCCTGTCCCTAGATTCAGCCAC^ 

ACTTAAAAAAAAAAAAAAAAAAAAAA 

Human VCY2IP1 niRN A sequence - var6 (public gi: 14042428) (seq id no: 219) 

AAGATGGCGGCGGTGGCTGGATCTGGGGCTGCCGCGGOTCCGAGCTCACTGCTCCTCGTGGTGGGCAGCG 

AGTTCGGGAGCCCGGGGCTCCTCACCTACGTCCTGGAGGAGCTCGAAAGAGGCATCCGGTCTTGGGATGT 

CGATCCTGGCGTCTGCAACCTTGATGAACAGCTCAAGGTCTTTGTGTCCCGACACTCTGC^ 

AGCATTGTGAAAGGCCAGCGGAGCCTGCACCACCGTGGAGACAACCTGGAGACCCTGGTCCTCCTGAACC 

CATCAGACAAGTCCCTGTATGATGAGCTCCGGAACCTTCTGTTGGACCCTGCCTCTCACAAGCTACTGGT . 

GTTGGCTGGGCTCTGCCTGGAGGAGACGGGGGAGCTGCTGCTACAGACAGGGGGCTTCTCGCCTCACCAC 

TTCCTCCAGGTCCTGAAGGACAGAGAGATCCGGGACATCCTGGCCACCACGCCCCCACCTGTGCAGCCGC 

CCATACTCACCATCACCTGCCCCACCTTCGGTGACTGGGCT 

GGGGGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCCGGCGCAGCTG 

TTCCTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCTCCG 
GGGGCTTCCTCAGGCTGGGCCGGCCCTGCTGCTACATCTTCCCTGGAGGCCTCGGGGATGCCGCCTTCTT 
CGCCGTGAATGGCTTCACTGTGCTGGTCAACGGTGGCTCJ^ 

CGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCACCCTGGCGCCGACAGCCTCCCCGGCCTCAACA 
GCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCT 
GCGCAGGCTCATCTCCCCCAACCTGGGGGTCGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCGGCTGGCG 
CGCX^CGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCACTCA 
GCCGCGGCCCCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTA 
TGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGGCAC 
CCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTGCCTCCTGG 
ACGGCCTGGTCCGCCTGCAGCAOTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGG 
GCCGGGGCGAGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCC 
ACCCACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCCCAC 
GCAAGACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAGGAAAGACCCCAAACCGAGTGTCTCCCGGAC 
CCAGCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGCA 
CCCAAGCCCCGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCGCAGCC 
CGCCCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTGGT 
GGCCACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCC 
GCCAGCTCAATCCCAAGGCCACGCACACCCTCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCGAGC 
GGCTGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGGCCAGACGCCTCACCC^CAGTGACCACACCCAC 
GGTGACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTGTCG 
GTGTCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTG 
GCCCCCGGGCGCGGCGCTCGGCTTCCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGA 
GCATCGCAAGGCGGTGCCAATGGGACCGGCACCrGCGT^ 

TC^CAGGAACGGGC^C3GTGGGCTGGGGGCCGAGGAGACGC(^CCCACATCGGTCAGCGAGTCCCTGCCCA 
CCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGGCTT 
TGGAGTCCCTCGCC^CGACCCTTTGCCTGACCCCCTC^GGTCCCCCCACCACTGCCTGACCCATCCAGC 
ATCTGCATGGTGGACCCCGAGATGCTGCCCCCCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACC 
CGGAAGCCCCTGGCCCGCCCC^CTCACGCGCTGCCGCCCCC^^GCC^CTCCAGTGGCTGCTGCCAAAA 
CCAAGGGGCTTGCTGGTGGGGACCGTGCCAGCCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGG 
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JVGGCCX3GGCACCCCTGTCCAGAAAGTCCT 

AGCAGCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGTCTACC^ 
GCGGGAGCAGCXXZCCACCTGGTGGATGAGGAGTTC^ 

TGGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACT 

TGGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGCATACGTGGTACG 

AGACGGACGCCCGGCACCAGGCGCTGGGCATCACGGTC 

TGACGCCTTCCCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCaSACACGeCCCCCACTCAGC 
GCCTGTCCCTAGATTCAGCCAGATCAGAAATAAACTGTGACTACACTTG 

Human VCY2IP1 raRNA sequence- var7 (public gi: 13623504) (SEQ ID NO: 220) 

GGCACGAGGCCCTOTATGATGAGCTCCGGAACOTTCTGTTGGA 

GGCTGGGCCCTGCCTGGAGGAGACGGGGGAGCTGCTGCTACAGACAGGGGGCTTCTCG^ 
CTCCAGGTCCTTGAAGGACAGAGAGATCCGGGACATCCTGGCCACGACGCCCCC^ 
TACTCAC(^TCACCTGCCCCACCTTCGGTGACT 
GGCGCTCCGGCTCCAGCTGCGGCTGAACCCCCC^ 

CTGGAGTACGTGGCTGAGTCTCTGGAGCCACCGTCCCCCTTCGAGCTGCTGGAGCCCCCGACCTCCGGGG 
GCTTCCTCAGGCTGGGCCGGCCCTGCTGCTACATCTTCCCTG 
CGTCAATGGCTTCACTGTGCTGGTCAACGGTGGCTCA^ 
CACCTGGACCGCGTGGATGCCGTGSCTGGTGACCCACCCTGGCGCCG 

TGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGGCTGCTGGTGGGGGCTCCTGGGACGACAGGCTGCG 
C^GGCTC^TCTCCCCCAACCTGGGGGTCGTGTTCT^ 

GGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCG^ 

GCGGCCCCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTATGT 

GCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTGTGTGTGCGCCCTGCTGGTGTGGCACCCC ' 

GCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGGACCCCGCCCGCCTGCCTCCTGGACG 

GCCTGGTCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGGGCC 

GGGG CGAG CCGAGAG CAAAGAGAGCGTGGGCTCCCGGGACAGCT CGAAGAGAGAGGGCCTCCTGGC CACC 

CACCCTAGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCCCACGCA 

AGACTGAGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCAAACCGAGTGTCTCCCGGACCCA 

GCCGCGGGAGGTGCGCCGGGCAGCCTCTTCTGTGCCC^CCTCTVAGAAGACGAATGCCCAGGCGGCACCC 

AAGCCCCGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTGGCAAATGGACCCCGCAGCCCGC 

CCAGCCTCCGATGTGGAGAAGCCAGCCCCCCCAGTGCGGCCTGCGGCTCTCCGGCCTCCCAGCTGGTGGC 

CACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCCGCC 

AGCTCAATCCCAAGGCCACGCACACCCTCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCGAGCGGC 

TGTCGCTGAGCCCACTGCGGGGCGGGGAGGCCGGC^CAGACGCCTCACC 

GACCACGCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTGTCGGTG 
TCCTTTGAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTO 
CCCGGGCGCGGCGCTCGGCTT.CCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGAGCA 
TCGCAAGGCGGTGCG^TGGCACCGGCACCTGCGTC 

C^GGAACGGGC^GGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTGAGCGAGTCCCTGCCCACCC 

TGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGG 

AGTCCCTCGC<^CGACCCTTTGCCTGACCCC^ 

TGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACCCGGA 
AGCCCCTGGCCCGCCCC^CTC^CGCGCTGCCGCCCCa^ 

GGGGCTTGCTGGTGGGGACCCaTGCCAGCCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGGAGGC 
CGGGCACCCCTGTCC^GAAAGTCCTCAACCCCCAAGACT 

GCCGGCCCGGGGTGTCAGCCACCCCACCCAAGTCCCCGGTCTACCTGGACCTGGCCTACCTGCCCAGCGG 
GAGCAGCGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGC^ 

CAGGACCAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGCATTGGG 
ACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGCATACGTGGTACGCAGAGAC 
GCACGCCCGGGACCAGGCGCTGGGCATGACGGTGTTGGG 
GCCTTCCCGGCCTGC^AGGTGGAGTTCTAGCCC^ 

GTCCCTAGATTCAGCCACATC^GAAATAAACTGTGACTACACTTGAAAAAAAAAAAAAAAAAA 

Hiuiian VCY2IP1 mRNA sequence - var8 (public gi: 10434393) (SEQ ID NO: 221) 

GAACCCCAAGTCCAGTTTCTGGAAGCTGGTGCGGCACCTGGACCGCGTGGATGCCGTGCTGGTGACCCAC 
CCTGGCGCCGACAGCCTGCCCGGCCTCAACAGCCTGCTGCGGCGCAAACTGGCGGAGCGCTCCGAGGTGG 
CTGCTGGTGGGGGCTCCTGGGACGAGAGGCTGCGGAGGCT 

CAACGCCTGCGAGGCCGCGTCGCGGCTGGCGCGCGGCGAGGATGAGGCGGAGCTGGCGCTGAGCCTCCTG 
GCGCAGCTGGGCATC^CGCCrrCTGCCACT^ 

AGAAGATGGGCGTGGGCCGGCTGGACATGTATGTGCTGCACCCGCCCTCCGCCGGCGCCGAGCGCACGCT 
GGCCTCTGTGTGCGCCCTGCTGGTGTGGCACCCCGCCGGCCCCGGCGAGAAGGTGGTGCGCGTGCTGTTC 
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CCOX;TTGCACCCCX3CCTOCCTGCCTCCTGGACGGCCTGGTCCGCCTGCAGCACT 

AGCCCGTGGTGACGCCCCAGGACCTGGAGGGGCCGGGGCGAGCCGAGAGCAAAGAGAG^ 

GGACAGOTCGAAGAGAGAGGGCCTCCTGGCCACCCACCCTAGACCTC 

CGCAAGGAGCCAGCACGGGCTGAGGCCCCACGCAAGACTGAGAAAGAAGCCAAGACCC 

AGAAAGACCCCAAACCGAGTGTCTCCCGGACCCAGCCGCGGGAGGTGCGCCGGGCA 

caacctcaagaagacgaatgcccaggcggcacccaagccccgcaaagcgcccagca 
ttcccgccggtggcaaatggaccccgcagcccgcccagcctccgatgtggaga 

CAGCCTGCGGCTCTCCGGCCTCCCAGCTGGTGGCCACGCCCAGCCTGGAGCTGGGGCCGATCCCAGCCGG 
GGAGGAGAAGGCACTGGAGCTGCCTTTGGCCGCCA^ 

TCCCACCGGAGCCCCGCAGAGGGCAGCGAGCGGCTGTCGCTGAGCCCACTO 

C^GACGCCT^CCCAC^GTGACCACACCCACGGTGACCACGCCCTCACTACCCGCAGAG^ 

GCACTCGACCGAGGTGGACGAGTCCCTGTCGGTGTCCTTTGAGC^ 

AGTGAGGCTGGGCTGAGCCTCCCGCTGCGTGGCCCCCGGGCGCGGCGCTCGGCTTCCCCACACGATGTGG 
ACCTGTGCCTGGTGTCACCCTGTGAATTTGAGCATCGCA^ 

CCCCGGGAGCTCGAATGACAGCAGTGCCCGGTCACAGGAACGGGCAGGTGGGCTGGGGGCCGAGGAGACG 

CCACC(^CATCGGTCAGCGAGTCCCTGCCCACCCTGTCTGACTCGGATCCCGTGCCCCTGGCCCCCGGTG 

CGGCAGACTCAGACGAAGACACAGAGGGCTTTGGAGTCCCTCGCCACGACCCTTTGCCTGACCCCCTCAA 

GGTCCCCCCACCACTGCCTGACCCATCCAGCATCTGCATGGTGGACCCCGAGATGCTGCCCCCCAAGACA 

GCACGGCAAACGGAGAACGTCAGCCGCACCCGGAAGCCCCTGGCCCGCCCCAACTCACGCGCTGCCGCCC 

CCAAAGCCACTCCAGTGGCTGCTGCCAAAACCAAGGGGCTTGCT^^ 

CAGTGCCCGGAGTGAGCCCAGTGAGAAGGGAGGCCGGGCACCCCTGTCCAGT^ 

ACTGCCACTCGAGGCCCGTCGGGGTCAGC(^G(^GCCGGCCCGGGGTGTC^GCCACCCC^CCC^AGTCCC 
CGGTCTACCTGGACCTGGCCTACCTGCCCAGCGGGAGCAGCGCCCACCTGGTGGATGAGGAGCTCTTCCA 
GCGCGTGCGCGCGCTCTGCTACGTCATCAGTGGCCAGGACCAGCGCAAGGAGGAAGGCATGCGGGC^ 
CTGGACGCGCTACTGGCCAGCAAGCAGCATTGGGACCGTGACCTGCAGGTGACCCTGATCCCCACTTTCG 
ACTCGGTGGCCATGCATACGTGGTACGCAGAGACGCACGCCCGGCACCAGGCGCTGGGCATCACGGTGTT 
" GGGC^GCAACAGCATGGTGTCCATGCAGGATGACGCCTTCCCGGCTTGCAAGGTGGAGTTCTAGCCCCAT 
CGCCGACACGCCCCCCACTO^GCCCAGCCCGCCTGTCCCTAGATTCAGCCACATCAGAAATAAACTGTGA 
CTAC 

Hxunan VCY2IP1 mRNA sequence - var9 (public gi: 7022843) (seq id no: 222) 

CATCTCCCCCAACCTGGGGGTCGTGTTCTTCAACGCCTGCGAGGCCGCGTCGCGGCTGGCGCGCGGCGAG 

GATGAGGCGGAGCTGGCGCTGAGCCTCCTGGCGCAGCTGGGCATCACGCCTCTGCCACTCAGCCGCGGCC 

CCGTGCCAGCCAAACCCACCGTGCTCTTCGAGAAGATGGGCGTGGGCCGGCTGGACATGTATGTGCTGCA 

CCCGCCCTCCGCCGGCGCCGAGCGCACGCTGGCCTCTGTGTGCGCCCTGCTGGTGTGGCACCCGGCCGGC 

CCCGGCGAGAAGGTGGTGCGCGTGCTGTTCCCCGGTTGCACCCCGCCCGCCTGCCTCCTGGACGGCCTGG 

TCCGCCTGCAGCACTTGAGGTTCCTGCGAGAGCCCGTGGTGACGCCCCAGGACCTGGAGGGGCCGGGGCG 

AGCCGAGAGCAAAGAGAGCGTGGGCTCCCGGGACAGCTCGAAGAGAGAGGGCCTCCTGGCCACCCACCCT 

AGACCTGGCCAGGAGCGCCCTGGGGTGGCCCGCAAGGAGCCAGCACGGGCTGAGGCCCCACGCAAGACTG 

AGAAAGAAGCCAAGACCCCCCGGGAGTTGAAGAAAGACCCCAAACCGAGTGTCTCCCGGACCCAGCCGCG 

GGAGGTGCGCCGGGCAGCCTCTTCTGTGCCCAACCTCAAGAAGACGAATGCCCAGGCGGCACCCAAGCCC 

CGCAAAGCGCCCAGCACGTCCCACTCTGGCTTCCCGCCGGTGisCAAATGGACCCCGCAGCCCGCCCAGCC 

TCCGATGTGGAGAAGCCAGCCCCCCCAGTGCAGCCTGCGGCTCTCCGGCCTCCCAGCTGGTGGCCACGCC 

CAGCCTGGAGCTGGGGCCGATCCCAGCCGGGGAGGAGAAGGCACTGGAGCTGCCTTTGGCCGCCAGCTCA 

ATCCCAAGGCCACGCACACCCTCCCCTGAGTCCCACCGGAGCCCCGCAGAGGGCAGCGAGCGGCTGTCGC 

TGAGCCCACTGCGGGGCGGGGAGGCCGGGCCy\GACGCCTCACCCACAGTGACCACACCCACGGTGACCAC 

GCCCTCACTACCCGCAGAGGTGGGCTCCCCGCACTCGACCGAGGTGGACGAGTCCCTGTCGGTGTCCTTT 

GAGCAGGTGCTGCCGCCATCCGCCCCCACCAGTGAGGCTGGGCTGAGCCTCCCGCTGCGTGGCCCCCGGG 

CGCGGCGCTCGGCTTCCCCACACGATGTGGACCTGTGCCTGGTGTCACCCTGTGAATTTGAGCATCGCAA 

GGCGGTGCCAATGGCACCGGCACCTGCGTCCCCCGGCAGCTCGAATGACAGCAGTGCCCGGTCACAGGAA 

CGGGCAGGTGGGCTGGGGGCCGAGGAGACGCCACCCACATCGGTCAGCGAGTCCCTGCCCACCCTGTCTG 

AGTCGGATCCCGTGCCCCTGGCCCCCGGTGCGGCAGACTCAGACGAAGACACAGAGGGCTTTGGAGT 

TCGCCACGACCCTTTGCCTGACCCCCTCAAGGTCCCCCCACCACTGCCTGACCCATCCAGCATCTGCATG 

GTGGACCCCGAGATGCTGCCCCCCAAGACAGCACGGCAAACGGAGAACGTCAGCCGCACCCGGAAGCCCC 

TGGCCCGCCCCAACTCACGCGCTGCCGCCCCCAAAGCCACTCCAGTGGCTGCTGCCAAAACCAAGGGGCT 

TGCTGGTGGGGACCGTGCCAGCCGACCACTCAGTGCCCGGAGTGAGCCCAGTGAGAAGGGAGGCCGGGCA 

CCCCTGTCCAGAAAGTCCTCAACCCCCAAGACTGCCACTCGAGGCCCGTCGGGGTCAGCCAGCAGCCGGC 

CCGGGGTGTC^GCCACCCCACCCAAGTCCCGGGTCTACCTGGACCTGGCCTACC^^ 

CGCCCACCTGGTGGATGAGGAGTTCTTCCAGCGCGTGCGCGCGCTCTGCTACGTCATCAGTGGCCAGGAC 
CAGCGCAAGGAGGAAGGCATGCGGGCCGTCCTGGACGCGCTACTGGCCAGCAAGCAGCATTGGGACCGTG 
ACCTGCAGGTGACCCTGATCCCCACTTTCGACTCGGTGGCCATGCATACGTGGTACGCAGAGACGCACGC 
CCGGCACCAGGCGCTGGGCATCACGGTGTTGGGCAGCAACGGCATGGTGTCCATGCAGGATGACGCCTTC 
CCGGCCTGCAAGGTGGAGTTCTAGCCCCATCGCCGACACGCCCCCCACTCAGCCCAGCCCGCCTGTCCCT 
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AGATT(^GCCACATCAGAAATAAACTGTG^CTACACTTG 

Human VCY2EP1 Protein sequence - varl (public gi: 22002953) (seq id no: 315) 

MAAVAGSGAAAAPSSLLLVVGSEFGSPGLLTYVLEEl^RGIRSWDTO 

XVKGQRS LHHRGDNIxETLVTjIiNPSDKSIjYDEIiRNT J iT iDPASHKLLVLAGLCLEETGELLLQTGGFS PHHF 
LQVLKDRE I RD I LATTP P P VQP P I LT I TCPT FGD W AQPAP A\n?GLQGALRLQLRLNP P AQLPNS EGLCE F 
LEWAESLEPPSPFELLEPPTSGGFLRIjGRPCCYIFPGGLGDAAFFAWGFTVLW 
HLDRVDAVLVTHPGADSLPGIiNSLJ^RKLA 

GEDEAELALSLIiAQI/5ITPLPLSRGPVPAXPTVLFEKMGVGRLDMyVIiHPP 

AGPGEKVVRVLFPGCTPPACLLDGLVRLQHLRFLRE PWTPQDLEGPGRAES KES VGSRDSS KRE GLLAT 
HPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELRKDPKPSVSRTQPREV^^ 

"KPRKAPSTSHSGFPPVANGPRS PPSLRCGEAS PPSAACGSPASQLVATPSLELGP I PAGEEKALELPLAA 

SSIPRPRTPSPESHRSPAEGSEI^SLSPIiRGGEAGPDASPTVTTPTVTTPSLPAEVGSPHSTEVDES 

SFEQVLPPSAPTSEAGLSLPLRGPRARRSASPHDVDLCLVSPCEFEHRKAVPMA^ 

QERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAADSDEDTEGFGVPRHDPLPDPLKVPPPLPDPSSI 
CMVDPEMLPPKTARQTENVSRTRKPLARPNSRAAAPKATPVAAAKTKG 
RAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPVYLDLAYLPSGSSAHLVDEEFFQR^ 
QDQRKEEGMRAVLDALLAS KQHWDRDLQVTLI PT FDSVAMHTW YAETHARHQALG I TVLGSNSMVSMQDD 
AFPACKVEF 

Human VCY2EP1 Protein sequence - var2 (public gi: 21739763) (seq id no: 316) 

PKMAAVAGSGAAAAPSSLLLWGSEFGSPGLLTYVLEELERGIRSWDVDPGVCOTjDEQLI^ 
SSIVXGQRSLHHRGDNLETLVLLNPSDK5LYDELRNLL 

HFLQVLKDRE I RDI LATTPPPVQPP I LTI TCPTFGDWAQLAPAVPGLQGALRLQLRLNPPAQLPNSEGLC 

EFLEWAESLEPPSPFELLEPPTSGGFLRLGRPCCYIFPGGLGDAAFFAWGFTVLTOGGSNPKSSFWKL 

VRHLDRVDAVLVTHPGADSLPGLNSLLRRKLAERSEVAAGGGSTO 

ARGEDEAELAIiSLLAQMITPLPLSRGPVPAKPTVLFEKMGVGRLDMYVI^ 

HPAGPGEKVAmVLFPGCTPPAYLLDGLVRLQHLRFIiREPVVTPQDLEGPG 

ATHPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELKKDPKPSVSRTQPREVRRAASS 

APKPRKAPSTSHSGFPPVANGPRSPPSIiRCGEASPPSAACGSPASQLVATPSIiELGPIPAGEEKALELPL 

AASS I PRPRTPSPESHRSPAEGSERLSLS PLRGGEAGPDASPTVTTPTVTTPSLPAEVGS PHSTEVDESL 

SVSFEQVLPPSAPTSEAGLSLPLRGPRARRSASPHDVDLCLVSPCEFEHRKAVP^PAPASPG 

RSQERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAADSDEDTEGFGVPRHDPLPDPLKVPPPLPDPS 

SICMVDPEMLPPKTARQTENVSRTRKPIiARPNSRAAAPKATPVAAAKTKGLAGGD 

GGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPVYLDIAYLP 

SGQDQRKEEGMRAVLDALLAS KQHWDRDLQVTLI PT FDSVAMHTW YAETHARHQALG I TVLGSNSMVSMQ 
DDAFPACKVE F 

Hitman VCY2EP1 Protein sequence - var3 (public gi: 21104445) (seq id no: 317) 

MGVGRLDMYVLHPPS AGAERTLASVCALLVWHPAGPGE KWRVLFPGCTP PACLLDGLVRLQHLRFLREP 
VVTPQDLEGPGRAESKESVGSRDSSKREGLLATHPRPGQERPGVARKEPARARAPRXTEK^ 
DPKPSVSRTQPREVRRAASSVPNLKKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAA 
CGSPASQLVATPSLELGPIPAGEEKALELPLAASSIPRPRTPSPESHRSPAEGSERLSLSPLRGGEAGPD 
ASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEAGLSLPLRGPRARRSJ\SPHDVDL 
CLVSPCEFEHRKAVPMAPAPASPGSSNDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAA 
DSDEDTEGFGVPRHDPLPDPLKVPPPLPDPS S IOWDPEMLPPKTARQTENVSRTRKPLARPNSRAAAPK 
ATPVAAAKTKGLAGGDRASRPLSARS E PS E KGGRAPLSRKS STPKTATRGPSGSAS SRPGVSATPPKS PV 
YLDLAYLPSGS SAHLVDEEFFQRVRALCYVI S GQDQRKEEGMRAVLDALLASKQHWDRDLQVTLI PTFDS 
VAMHTW YAETHARHQALGI TVLGSNSMVSMQDDAFPACKVE F 

Human VCY2IP1 Protein sequence - var4 (public gi: 14250680) (seq id no: 318) 

mgvgrldmyvlhppsagaertlasvgallvwhpagpgekvvrvlfpgctppa 

wtpqdlegpgraeskesvgsrdsski^gli^thprpgqerpgvarkeparaeaprktekeaktpi^ 

dpkpsvsrtqprevrraassvpnlkktnaqaapkprkapstshsgfppvangprsppslrcgeasppsaa 

cgspasqlvatpslelgpipageekalelplaassiprprtpspeshrspaegserlslsplrggeagpd 

asptvttptvttpslpaevgsphstevdeslsvsfeqvlppsaptseaglslplrgprarrsasphdvdl 

clvspcefehrkavpmapapaspgssndssarsqeragglgaeetpptsvseslptlsdsdpvplapgaa 

dsdedtegfgvprhdplpdplkvppplpdpssi<>ivdpemlppktarqteotsrtrki>i^ 

atpvaaaktkglaggdrasrplsarsepsekggraplsrksstpktatrgpsgsassrpgvsatppkspv 

yi^laylpsgssahlvdeeffqrvralcyvtsgqdqrkeegmra^ 

vamhtwyaetharhqalgitvlgsnsmvsmqddafpackvef 
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Human VCY2IP1 Protein sequence - var5 (public gi: 13938255) (seq id no: 319) 

DTDRDSSTSVSFEQVIjPPSAPTSEAGLSLPLRGPRARRSASPHDVDLC^ 

GSSNDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDPVPIjAPGAADSDEDTEG 

PPPLPDPSSICMVDPEI^PPKTARQTENVSRTRKPLARPN^ 

ARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPP 

VRAIjCYVISGQDQRKEEGMRAVLDALLASK^ 

SNSMVSMQDDAFPACKVEF 

Human VCY2IP1 Prcrtein sequence- var6 (public gi: 14042429) (seq id no: 320) 

maavagsgaaaapsslllwgsefgspglltyviieel^ 

ivkgqrslhhrgdnletlvllnpsdkslydelrnllldpash^ 

lqvlkdreirdilattpppvqppiltitcptfgdwaqpapavpgix2(^ 

leyvaesleppspfellepptsggfixriig^ccyifpgglg^ 

HLDRVDAVLiVTH PGADS LPGLNS LLRRKLAERS CTAAGGGS WDDRLRRL I S PNLGWFFNACEAASRIiAR 
GEDEAELALSLIiAQLiGITPLPLSRGPVPAKPTVLFEKMGVGRI^^ 
AGPGEKVVRVLFPGCTPPACLLDGLVRLQHIjRFIjRE 
HPRPGQERPGVARKEPARAEAPRIO'EKEAKTPREIiRKDPCTSVSRTO 

KPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAACGSPASQLVATPSIiELGPIPAGEEKAIiELPIiAA 
SSIPRPRTPSPESHRSPAEGSERIjSIiSPLRGGEAGPDASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSV 
S FEQ VLP PS APTS EAGLS LPLRGPRARRS AS PHDVDLCLVS PCE FEHRKAVPMAP APAS PGS SNDS S ARS 
QERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAADSDEDTEGFGVPRHDPLPDPLKVPPPLPDPSSI 
O^PEMLPPQDSTANGERQPHPEAPGPPQLTRCRPQSHSSGCCQNQGACWWGPCQPTT^ 

Human VCY2IP1 Protein sequence - var7 (public gi: 13623505) (seq id no: 321) 

MGVGRLDMYVLHPPSAGAERTLACVCALLVWHPAGP 

VVTPQDIiEGPGRAESKESVGSRDSSKREGLLATHPRPGQERPGVARKEPAR^ 

DPKPSVSRTQPREVRRAASSVPNLKKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAA 
CGSPASQIiVATPSLEIjGPI PAGEEKALEIiPLAASS I PRPRTPS PESHRSPAEGSERIiSLSPLRGGEAGPD 
ASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEAGLSLPLRGPRARRSASPHDVDL 
CIjVS PCEFEHRKAVPMAP APAS PGS SNDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDPVPIiAPGAA 
DSDEDTEGFGVPRHDPLPDPLKVPPPLPDPSSICMVDPEMLPPKTARQTENVSRTRK^ 
ATPVAAAKTKGtiAGGDRASRPLST^RSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPV 
YIiDLAYLPSGSSAHLVDEEFFQRVRALCYVISGQDQRKEEGMRAVLDAIj 
VAMHTWYAETHARHQALGITVLGSNSMVSMQDDAFPACKVEF 

Human VCY2EP1 Protein sequence - var8 (public gi: 10434894) (seq id no: 322) 

MGVGRLDMYVLHPPSAGAERTLASVC^U^LVWHPA 

WTPQDLEGPGRAESKESVGSRDSSK31EGLLATHPRPGQERPGVARKEPARAEAPRKTEKEAKTPRELKK 
DPKPSVSRTQPREVRRAASSVPNL.KKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSIiRCGEASPPSAA 
CGS PASQLVATPSLELiGPI PAGEEKALELPLAASS I PRPRTPS PESHRSPAEGSERLSLSPLiRGGEAGPD 
ASPTVTTPTVTTPSLPAEVGSPHSTEVDESLSVSFEQVLPPSAPTSEAGLSLPLRGPRARRSASPHDVDL 
CLVSPCE FEHRKAVPMAP APASPGSSNDSSARSQERAGGLGAEETPPTSVSESLPTLSDSDPVPLAPGAA 
DSDEDTEGFGVPRHDPLPDPLKVPPPLPDPSSICMVDPEI^PPKTARQTEKTtfS 

ATPVAAAKTKGLAGGDRASRPLSARSEPSEKGGRAPLSRKSSTPKTATRGPSGSASSRPGVSATPPKSPV 

YLDLAYLPSGSSAHLVDEELFQRVRALCWISGQDQRKEEGMRAVIjDALLASKQHWDR^ 

VAI^TWYAETHARHQALGITVLGSNSMVSMQDDAFPACKVEF 

Human VCY23P1 Protein sequence - var9 (public gi: 7022844) (seq id no: 323) 

MGVGRLDMYVLHP PS AGAERTLASVCALLVWHPAGPGEKVVRVI»FPGCTP PACLLDGIiVRLQHIjRFLREP 
WTPQDLEGPGRAESKESVGSRDSSK^GLIATHPRPGQERPGVARK^PARAEAPRKTEKEAKTPRELKK 
DPKPSVSRTQPREVRRAASSVPNIjKKTNAQAAPKPRKAPSTSHSGFPPVANGPRSPPSLRCGEASPPSAA 
CGSPASQLVATPSLELGPIPAGEEKALELPLAASSIPRPRTPSPESHRSPAEGSERLSLSPIiRGGEAGPD 
AS PTVTTPTVTTPSLPAEVGS PHSTEVDESLS VS FEQVLPPSAPTSEAGLSLPLRGPRARRSAS PHDVDL 
CIiVSPCEFEHRKAVPMAPAPASPGSSNDSSARSQERAGGLGAEETPPTSVSESLPTIiSDSDPVPLAPGAA 
DSDEDTEGFGVPRHDPLPDPLKVPPPLPDPSSICMVDPEMLPPKTARQ 

ATPVAAAKTKGLAGGDRASRPLSARS E PSE KGGRAPLSRKS STPKTATRGPSGSASSRPGVS ATP PKS PV 
YLDLAYLP SGS S AHLVDEE FFQRVRALCYVT SGQDQRKEEGMRAVTjDAIjIiAS KQHWDRDLQVTL I PT FDS 
VAMHTWYMTHARHQAIiGITVLGSNGMVSMQDDAFPACKVEF 
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Unigene Namej SPG20 Unigene ID; Hs. 118087 

Human SPG20 mRNA sequence - varl (public gi: 28436884) (SEQ ID NO: 367) 

AGTGTAAGGGAGTCK3GAGCTGGTCC 

GCCCX3AGGTCTGGAAGGCGCAGAAATGGAGCAAGAGC 

TCAGAGAAGCATATAAGAAGGCCTTTTTATTTGTTAACAAAGGTCTGA 

GGAAGAAGCAAAGAACTACTATAAGCAAGGAATAGGACACCTGCTCAGAGGGAT 

GAGTCTGAACACACAGGTACTGGGTGGGAATCTGCTAGACAGATGC 

AGAATGTACGCACCAGGCTGGAAATTCTAGAGAAGGGTCTTGCCACTTCTCTC 

GGTGCCCAAGTTATATCCAGAATTTC(3^CCTAAAGACATGTC 

AGTTCAGCTCCTCAGCATGCTGAAGTAAATGGAAACACCTCAACTCCAAGTGCAGGG 
CTGCTTCTCTGTCTTTACGATCACAAAGTTGTCCAGCAGAAGCTCCT 
TGAAGGTCACTACACTCTATCCTATGGAACAGATTC 
AGGAATCATTCTCAGCCACCGCCTCTTGAGACCTT^ 

ATGGAGTACAGATTTTTTTTGTAAATCCTGCAGGGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCG 
AATTGTGAGGTTTTTGGATAATTCTCTCGATACGG 

GACTGGTTATATCCTCTAGTTCCTGATAGATCTCCGGTTCTGAAATGTACTGCGGGAGCCTACATGTTTC 
CTGATACAATGCTACAAGCAGCAGGATGCTTTGTGGGGGTCGTCCTGTCCTCTGAGTTA 
TAGAGAGCTCTTTGAGGATCTGTTAAGGCAAATGTCTGACCTTCGGCTCCAGGCCZAAOT 
GAAGAAGAAAATGAATTCCAAATCCCTGGAAGAACTAGAC^ 

GCACTGATGTGAAACAGTTGGACCAAGGCAATAAGGATGTACGTCATAAAGGAAAACG 
TAAAGATACTTCAAGTGAAGAAGTTAACCTGAGTCACATT 

CCAAAAGAATTACATGAATGGAGTGAAAAAGTGGCTCACAACATTTTGTCAGGTGCTTCCTGGGTGAGTT 

GGGGTTTAGTCAAAGGTGCTGAGATTACTGGTAAGGCAATCCAGAAAGGTGCTTCTAAACTCCGAGAGC^ 

GATTCAACCAGAAGAAAAACCCGTGGAAGTTAGTCCAGCTGT(^^ 

GCTACAGGAGGAGCAGCAAAAGTCAGTGAGTTCCTGGTTG^^ 

GAAAAGAACTAGCTCCACATGTCAAGAAGCATGGAAGCAAACT^^ 

AGATGGGAAATCTCCTCTGGATGGTGCTATGGTTGTAGCAGCGAGTAGTGTTCAAGGATTTTCAACTGTC 

TGGCAAGGATTGGAATGTGCAGCTAAATGCATCGTTAACAATGTTTCAGCAGAAACTGTACAAACTGTC^ 

GATACAAATACGGATATAATGCAGGAGAAGCTACCCACCATGCGGTGGATTCTGCGGTCAATGTTGGCGT 

AACTGCCTACAATATTAACAAGATTGGTATCAAAGCAATGGTGAAGAAAACTGCAACA 

ACTCTCCTTGAGGACTATCAGATAGTTGATAATTCTCAGAGGGAAAATGAAGAAGGAGCAGCAAATGT^ 

ACGTGAGAGGGGAGAAGGATGAGCAGACGAAGGAAGTAAAGGAGGCAAAGAAGAAAGATAAATGATGAAG 

TGCTGGGAATCACTTATACCAAAGCCTTATGAAATGGATGAAATTTTGTTAAATAGGCAAATGTGGAATT 

CCTCACAGATTAACCAGTATTTTTTAAATGTATTCATTCCTACAAATTAACTTTCATAAATTTTATGGCA 

TGTCTTCTATTTAAAAGGAAAAGAATAAGTATTCTTGC^TCTGGCCTTAGAAATGTGAAGTTATATTCTC 

AAGTTTATTTTTTTCCAAGTGTAGCTAAAATATTTTTGCAGGTAAAATAAAGCTGATAGTACATGTGTTG 

TTCAAACCTTGTTAAACCTAATATTGAACTATTTTTATATCTGCTGTCTTTCAGAAGGCAAATAGGAAAC 

TATATATTTGCTTAAAAATTGGCATTTAGTAACCTTAATTCTTTTTATAGAAGGAATGACTTA 

GTCCCCTCTTTTTGCACTAATTGTGGATTTTTTTAGATGCTTCTCAAAATTTTCAGTGTGTAAGOT 

AAAAACTAAAACTAAGAATTCTCAAAAAGACTTGTTCAAAACAGGGAAAGACTGATGAAAAGTAAAATGG 

ACTACTTTTGTAACTTACCTGTTTGTTAGGAAATGGAATGGTTTCTTTGATTTAAAATGAATAAAAATAG 

ATTATTACGTCTTTTGTATTGAGACTGTATTGTTATGAGCCTAGGAAATTTGGGAACATGATTGTATTGT 

ATTAAAATTCGAAGTGATTATTATCAGCTTAATTGGATTAAAAAAGTACTTCAAGAAATTAAAAAAAA^ 

AAAAAAAAAAAATAAAAAAAAAAAAAAA 

Human SPG20 mRNA sequence - var2 (public gi: 7023530) (SEQ ID NO: 368) 

AGGGAGCTCTCGAGGCAACGCCGGGGCGCCCGAGGTCTGGAAGGCGCAGAAATGGAGCAAGAGCCACA^ 
ATGGAGAACCTGCTGAAATTAAGATCATCAGAGAAGCAT^ 

TCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACTATAAGCAAGGAATAGGACACCTG 
CTC71GAGGGATCAGCATTTCATCAAAAGAGTCTG 
TGCAACAOAAAATGAAAGAAACTCTA<^GAATGT^ 
CACTTCTCTGCAGAATGATCTTCAGGAGGTGGCCAAGT^ 
• GAAAAATTACC^GAGCCTCAGTCTTTTAGTTCAGCT 
CTCCAAGTGCAGGGGCAGTTGCTGCACCTGCTTCTCTC 

TCCTCCTGCTTATACTCCTCAAGCTGCTGAAGGTCACTACACTGTATCCTATGGAACAGATTCTGGGGAG 
TTTTCATCAGTTGGAGAGGAGTTTTATAGGAATGATTCT 

TGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTOTCTCGATACGGTTCTAAAC 
CGTCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATCTCCGGTTCTGA 
AATGTACTGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGCAGCAGGATGCTTTGTGGGGGTCG 
CCTGTCCTCTGAGTTACCAGAGGATGATAGAGAGCTCTTTGAGGATCTGTTAAGGCAAATGTCTGACCTT 
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CGGCTCCAGGCCAACTGGAACAG 

CCTCTGACCAACTAAAAGAAGCCTCTGGCACTGATGT^^ 

TCATAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGA^ 

CCATGTGAGCCAGTTCCAGAAGAAAAGCCAAAAGAATTACCTGAATGGA 

TTTTGTCAGGTATTA(^GTAATGTTAATTTTTTTC 

ATATTGTTTATTAGGGAATACTGAGAAAGATAATATTTTGTATTT^ 

TAAATGTAGAAGGAACTAGTCTTTTGAAACAT^ 

TTATTCTGTTTTACTATTGGGAAAATGGATAGTGAAAA CAAATT CTTAATAGTTCT 

GAATCTAAAATTAGTTATGTTCGTTTCCCCTTTGAAGCTCCCTCTTAACCTCCCCCTACCCCTGTC 

AGCTGTGGTCTGAATGTGTCCCTTCAAAATTCATATATTGAAATCCTAACCCCTGAGGT 

GAGGTGGGGCCTTTGGAAGGTGATTAGGTCATGAGGGAGGAGCCCTCATCAATGGGATTAGT 

AAAGAGATCCCAAAGAGCTGCCTTGTCCCTT^ 

AACCAGGAAGTGGGCCCCTCACGAGAGACCAAATGTAC 

GAATTGTGAGAAATAAATTTTTGTTGTTAAT 

Human SPG20 mRNA sequence - var3 (public gi: 7023938) (SEQ ID NO: 369) 

GATAATTCTCTCGATACGGTTCrTAAAC(^TCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTC 
TAGTTCCTGATAGATCTCCGGTTCTGAAATGTACTGCGGC^ 

AGCAGCAGGATGCTTTGTGGGGGTCGTCCTGTCCrCTGAGTTACCAGAGGATGATA 

GATCTGTTAAGGC^VAATGTCTGACCTTCGGCTCCAGGCCAACT 

TCCAAATCCCTGGAAGAACTAGACCCTCCTCTGACCAACTAAAAGAAGCCT 

GTTGGACCAAGGCAATAAGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGOT 

GAAGAAGTTAACCTGAGTCACATTGTACCATGTGAGCCAGTTCCAGAAGAAAA^ 

AACGGAGTGAAAAAGTGGCTC^CAACATTTTGTCAGGTGCTTCCTGGGTGAGTTGGGG 

TGCTGAGATTACTGGTAAGGCAATCCAGAAAGGTGCTTCTA 

AAACCCGTGGAAGTTAGTCCAGCTGTCACCAAGGGACTTT^^ 

CAAAAGTGAGTGAGTTCCTGGTTGATGGAGTTTGCACTC 

ACATGTCAAGAAG CATGGAAGCAAACTTGTT CGAGAATCT CTTAAAAAAGACAAAGATGGGAAATCTCCT 

CTGGATGGTGCTATGGTTGTAGGAGCAAGTAGTGTTCAAGGATTTTC^^ 

GTGCAGCTAAATGCATCGTTAACAATGTTTCAGC^ 

TAATGCAGGAGAAGCTACCCACCT^TGCGGTGGATTCTGCGGTCAATGTTGGCGTAACTGCCTACAATATT 
AACAACATTGGTATCAAAGCAATGGTGAAGAAAACTGCAA 

ATCAGATAGTTGATAATTCTCAGAGGGAAAATC^AGAAGGAGC^GC^AATGTCAACGTGAGAGGGGAG^ 
GGGTGAGCAGACGAAGGAAGTAAAGGAGGCAAAGAAGAAAGATAAATGATGAAGTGCTGGGAATCACTTA 
TACCAAAGCCTTATGAAATGGATGAAATTTTGTTAAATAG 

GTATTTTTTAAATGTATTCATTCCTACAAATTAACTTTGATAAATTTTATGGCA 

GGAAAAGAATAAGTATTCTTGCATCTGGCCTTAGAAATGTGAAGTTATATTCTCAAGTTTATTTTTTT 

AAGTGTAGCTAAAATATTTTTGCAGGTAAAATAAAGCTGATAGTACATGTGTTGTTCAAACCTT 

CCTAATATTGAACTATTTTTATATCTGCTGTCTTTCAGAAGGCAAATAGGAAACTATATATTTGCTTAAA 

AATTGGCATTTAGTAACCTTAATTCTTTTTATAGAAGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCA 

CTAATTGTGGATTTTTTTAGATGCTTCTCAAAATTTTCAGTGTGTAAGCTAAACAAAAACTAAAACTAAG 

AATTCTCAAAAAAACTTGTTCAAAACAGGGAAAGACTGATGAAAAGTAAAATGGACTACTTTTGTAACTT 

ACCTGTTTGTTAGGAAATGGAATGGTCTCTTTGATTTAAAATGAATAAAAATAGATTATTACGTC 

Human SPG20 nxRNA sequence - var4 (public gi: 16553694) (SEQ ID NO: 370) 

GTGCATGTTTTCTTCAGTCCTGGAAGGAAATCATAAGT 

GAGCAAGAGCCACAAAATGGAGAACCTGCTGAAATTAAGATCATCAGAGAAGCATATAAGAAGGCCTTT.T 

TATTTGTTAAGAAAGGTCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACT 

AGGAATAGGACACCTGCTCAGAGGGATCAGCATTTCATCAAAAGAGTCTGAACACACAGGTCCTGGGTGG 

GAATCTGCTAGACAGATGCAACAGAAAATGAAAGAAACTCTACAGAATGATCTTCGTATCCTGGGTACCT 

TCGAATTGTGAGGTTTTTGGATAATTCTCTCGATACGGTTCTAAACCGTCCTCCCGGGTTTCTTCAGGTT 

TGTGACHtSGTTATATCCTCTAGTTCCTGATAGATCTCCGGTTCTGAAATGTACTGCGGGAGCCTACATGT 

TTCCTGATACAATGCTACAAGCAGCAGGATGCTTTGTGGGGGTCGTCCTGTCCTCTGAGTTACCAG 

TGATAGAGAGCTCTTTGAGGATCTGTTAAGGCAAATGTCTGACCTTCGGCTCCAGGCCAACTGGAACAGA 

GCAGAAGAAGAAAATGAATTCCAAATCCCTGGAAGAACTAG 

CTGGCACTGATGTGAAACAGTTGGACCAAGGCAATAAGGATGTACGTCATAAAGGAAAACGTGGAAAAAG 

GGCTAAAGATACTTCAAGTGAAGAAGTTAACCTGAGTCAC^^ 

AAGCCAAAAGAATTACCTGAATGK3AGTGAAAAAGTGGCTCACAACA 

GTTGGGGTTTAGTCAAAGGTGCTGAGATTACTGGTAAGGCAATCCAGAAAGGTGCTTCTAAACTCCGAGA 
GCGGATTCAACCAGAAGAAAAACCCGTGGAAGTTAGTCCAGCTGTCACCAAGGGACTTTATATAGCGAAG 
CAAGCTACAGGAGGAGCAGCAAAAGTCAGTCAGTTCCTGGTTGATGGAGTTTGCACT 
TTGGAAAAGAACTAGCTCCACATGTCAAGAAGCATGGAAGCAAACTTGTTCCAGAATC 
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CAAAGATGGGAAATCTCCTCTGGATC 

GTCTGGCAAGGATTGGAATGTGC^GCTAAATGCATC^TTAACAATGTT 

TCAGATACAAATACGGATAATGCAGGAGAAGCTACCG^^ 

TAACTGCCTACAATATTGACAACATTGGTATC^^ 

C^CTCTCCTTGAGGACTATCAGATAGTTGATAATTC 

AACGTGAGAGGGGAGAAGGATGAGCAGACGAAGGAAGTAAAGGAC^ 

GTGCTGGGAATCACTTATACCAAAGCCTTATGAAATGGATGAAAra 

TCCTCACAGATTAACCA.GTATTTTTTAAATGTATTCATTCCTACA^ 

ATGTCTTCTATTTAAAAGGAAAAGAATAAGTATTCTTGCATCTGGCOT 

CAAGTTT ATTTTTTT CCAAGTGT AG CTAAAAT ATTTTTGCAGGTAAAAT AAAGCTGAT AGTACATGTGTT 

GTTCAAACCTTGTTAAACCTAATATTGAACTATTTTTATATCTGCTGTCTTT 

CTATATATTTGCTTAAAAATTGGCATTTAGT^ 

TGTCCCCTCTTTTTGCACTAATTGTGGATTTTT^ 

CAAAAA.CTAAAACTAAGAATTCTCAAAAAAACTTGTT 

GACTACTTTTGTAACTTACCTGTTTGTTAG 

GATTATTACGTC 

Human SPG20 mRjSfA sequence - var5 (pubHc gi: 21654722) (SEQ ID NO: 371) 

ATGGAGCAAGAGCCACAAAATGGAGAACCTGCTGAAATTAAGA 
TTTTATTTGTTAACAAAGGTCTGLAATAC^ 

GCAAGGAATAGGACACCTGCTCAGAGGGATCAGCATTTCATCAAAAGAGTCT 

TGGGAATCTGCTAGACAGATGCAACAGAAAATGAAAGAAACTCTACA^ 

TTCTAGAGAAGGGTCTTGCCACTTCTCTGCAGAATGATCTT 

TCCACCTAAAGACATGTGTGAAAAATTACCAGAGCCTGAGTCT^ 

GTAAATGGAAACACCTCAACTCCAAGTGCAGGGGCAGTTGCT 

AAAGTTGTCCAGCAGAAGCTCCTCCTGCTTATACTCCTC^GCTGCTGAAGGTCACTACACTGTATCCTA 

TGGAACAGATTCTGGGGAGTTTTCATCAGTTGGAGAGGAGTTTTATAGGAATCATTCTCAG 

CTTGAGACCTTAGGGCTGGATGCAGATGAATTGATTTTGAT^^ 

ATCCTGCAGGGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTC 
TCTCGATACGGTTCTAAACCGTCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCT 
GATAGATCT.CCGGTTCTGAAATGTACTGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGCAGCAG 
GATGCTTTGTGGGGGTCGTCCTGTCCTCTGAGTTACC^ 

AAGGCAAATGTCTGACCTTCGGCTCCAGGCCAACTGGAACAGAGCAGAAGAAGAAAA 
CCTGGAAGAACTAGACCCTCCTCTGACCAACTAAAAGAAGCCTCTGGCACTGATGTGAAACAGTTGGACC 
AAGGCAATAAGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGT 
TAACCTGAGT CACATTGTACCATGTGAGCCAGTT CCA.GAAGAAAAGCCAAAA.GAATTACCTGA 
GAAAAAGTGGCTCACAACATTTTGTCAGGTGCTTCCTGGGTGAGTTGGGGTTTA 

TTACTGGTAAGGCAATCCAGAAAGGTGCTTCTAAACTCCGAGAGCGGATTCAACCAGAAGAAAAACCCGT 
GGAAGTTAGTCCAGCTGTCACCAAGGGACTTTATATAGCGAAGCAAGCTACAGGAGGAGCAGCA7VAAGTC 
AGTCAGTTCCTGGTTGATGGAGTTTGGACTGTAGC^AATTC 

AGAAGCATGGAAGO\AACTTGTTCCAGAATCTCTTAAAAAAGACAAAGATGGGAAATCTCCTCTGGATGG 

TGCTATGGTTGTAGCAGCAAGTAGTGTTCAAGGATTTTCAACTGTCTGGCAAGGATTGGAATGTGC^ 

AAATGCATOSTTAACAATGTTTCAGC^GAAACTGTACAAACTGTCAGATACAAATACGGATATA^ 

GAGAAGCTACCCACCATGCGGTGGATTCTGCGGTCAATGTTGGCGTAACTGCCTACAATATTAACAAC^ 

TGGTATCAAAGCAATGGTGAAGAAAACTG CAACACAAACAGGACACACTCTC CTTGAGGACTATCAGATA 

GTTGATAATTCTCAGAGGGAAAATCAAGAAGGAGCAGCAAATGTCAACGTGAGAGGGGAGAAGGATGAGC 

AGACGAAGGAAGTAAAGGAGGCAAAGAAGAAAGATAAATGA 

Human SPG20 mRNA sequence - var6 (public gi: 22074831) (SEQ ID NO: 372) 

GCGGCCGCGC^GGGAGCTCTCGAGGCAACGCCGGGGCGC^ 
GAGCCACAAAATGGAGAACCTGCTGAAATTAAGATC^TCAG 

TTAACAAAGGTCTGAATACAGATGAATTAGGTCAGAAGGAAGAAGCAAA.GAACTACTATAA 
AGGACACCTGCTCAGAGGGATC^GCATTTCATC^^ 

GCTAGACAGATGCAACAGAAAATGAAAGAAACTCTACAGAATGTACGCACCAGGCTGGAAATTCTAGAGA 

AGGGTCTTGCCACTTCTCTGCAGAATGATCTTCAGGAGGTGCCCAAGTTATATCCAGAATTTCCACCTA 

AGACATGTGTGAAAAATTACC^GAGCCTCAGTCTTTTAGTTCAGCTCCTCAGCATGCTGAAGTAAATGGA 

AACACCTCAACTCCZUVGTGCAGGGGCAGTTGCT^ 

C^GCAGAAGCTCCTCCTGCTTATACTCCTCAAGCT 

TTCTGGGGAGTTTTCATCAGTTGGAGAGGAGTTTTATAGGAATC^ 

TTAGGGCTGGATGCAGATGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTT 

GGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATAC 
GGTTCTAAACCGTCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATCT 
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CCGGTTCTGAAATGTACTGCGGGAGCCTA^^ 
TGGGGGTCGTCCTGTCCTCTGAGTTACCAGAGGATGAT^^ 

GTCTGACCTTCGGCTCCAGGCCAACTGGAACAGAGCAGAAGAAGAAAATGAAT^ 
ACTAGACCCTCCTCTGACCAACTAAAAGAAGCCTCTG^ 

AGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAG 

TCACATTGTACCATGTGAGCCAGTTCCAGAAGAAAAGCCAAAAGAATTACCTG 

GCTC^CAACATTTTGTCAGGTGCTTCCTGGGTGAGCT 

AGGCAATCCAGAAAGGTGCTTCTAAACTCCGAGAGCGGATTCA 

TCCAGCTGTCACCAAGGGACTTTATATAGCXxAAGC^ 

CTGGTTGATGGAGTTTGCACTGTAGCAAATTGCGTTGGAAAAGAACTA 

GAAGCA^CTTGTTCCAGAATCTCTTAAAAA 

TGTAGCAGCAAGTAGTGTTCAAGGATTTTCAACTGTCT 

GTTAACAATGTTTCAGCAGAAACTGTACAAACTGT 

CCCA.CCATGCGGTGGATTCTGCGGTCAATGTTGGCGTAACTGCCTACAATA 

AGCAATGGTGAAGAAAACTOCAACACAAACAGGACACACT 

TCTCAGAGGGAAAATCAAGAAGGAGCAGCAAATGTCAAOT 

AAGTAAAGGAGG CAAAG AAG AAAGATAAATGATGAAGTGCTGGGAATCACTTAT ACCAAAGC CTT ATGAA 

ATGGATGAAATTTTCTTAAATAGGCAAATGTGGAAOT 

TCATTCCTACAAATTAACTTTCATAAATTTT^^ 

CTTGCATCTX^CCTTAGAAATGTGAAGTTATATTCTCAAGTTTATTTTTTTCCAAGT^ 
TTTTGCAGGTAAAATAAAGCTGATAGTACATGT^ 

TTTATATCTGCTGTCTTTCAGAAGGCAAATAGGAAACTATATATTTGCTTAAAAAOT 

CTTAATTCTTTTTATAGAAGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCACTAATTGTGGATTTTTT 

TAGATGCTTCTCAAAATTTTC^GTGTGTAAGCTAAACAAAAACTAAAACTAAGAATTCTCAAAAAAA 

GTTCAAAACAGGGAAAGACTGATGAAAAGTAAAATGGACTACTTTTGTAACTTACCTGTTT 

TGGAATGGTCTCTTTGATTTAAAATAAAT^^ 

TATGAGCCTAGGAAATTTGGGAACATGATTGTATTGTATTAAAATTCGAAGTGATTATTATCAGCTTAAT 
TGGATTAAAAAAGTACTTCAAGAAATTATTTTATCATATCTGCTTCTGTTTTTCCAAAAGGTTAAAACTT 
GTAAAAAAAATATATATAAACAATTGAGTTTACTAATGGTAAACATTTTTATTCTGGGATTCGGTCATTG 
GAATTTATATTAAAAGACAAGTTATTAAAAAGGAAAGGTTCTATTCATAATCAGGGTAAAGAATATGAAA 
ACCTTAGACGTAATCCATGGTGGATAGGCATTATGGTTTCCACTT^ 

CCTATTTACTTACATAGGCTAAAAAAOTATGTAACTAAATACCTAATGGTATTTAATTTTTGTTTACT 

ATTTAAGAGATTGGTATTAGTTTTCATAGCTGTAGTCCATTCTAATAATTTCTGATCTTCTAGTGGCTAC 

TTAATTAGACATTATTTGAAGCTGTCTGAAGAATGCACTTTATGAATTAAAA?^ACTGAATTGCCTGACCT 

CGTTATCACATGAGCTTATATTTTGGGAACACATAGAACTGATGGAGGCTTTTCCTAAGGCCAAGGATAA 

TGTACTAGTTGTTAAAATGGAAATAAAAGTGAAGTGGTAAAT 

Human SPG20 mRNA sequence - var7 (public gi:20070809) (SEQ ID NO: 373) 

GGCGGCGGCGTGCTGCGGGCTCTGTGGCGGGAGCGAGGCCGACGGGCGGGGCCGTGCGGCCGCGTGACGC 

GAAGCGTTCGAGAGCGCGCGTCGTGGAACGTCTTGGTTGCCACGGCAAGCGCGCGCGCGAGGCCTTGGGA 

ACCTCGGGACCGGCCCCCGGCGAGCGCAGCGGCGCCCAGTAGTCATCTTAGTGGGATTTGGGGAAGCAAC 

AGGGCTGTGTGGGGTAACCTGCCACCTTTAAGTGGAAATCAGAAATGGAGCAAGAGCCACAAAATGGAG^ 

ACCTGCTGAAATTAAGATCATCAGAGAAGCATATAAGAAGGCCTTTTTATTTGTTAACAAAGGTCTGAAT 

ACAGATGAATTAGGTCAGAAGGAAGAAGCAAAGAACTACTATAAGCAAGGAATAGGACACCTGCTCAGAG 

GGATCAGCATTTCATCAAAAGAGTCTGAACA(^GAGGTCCTGGGTGGGAATCTGCTAGACAGATGC^C^ 

GAAAATGAAAGAAACTCTACAGAATGTACGCAC CAGGCTGG AAATTCT AGAGAAGGGTCTTG CCACTTCT 

CTGCAGAATGATCITCAGGAGGTGCCCIAAGTTATATCCAGAATTTCCACCTAAAGACA 

TACCAGAGCCTCAGTCTTTTAGTTCAGCTCCTCAGC^TGCT 

TGCAGGGGCAGTTGCTGCACCTGCTTCTCTGTCTTTACO 

GCTTATACTCCTCAAGCTGCTGAAGGTCACTACACTGTATCCTATGGAACAGATTCTGGGGAGTTTTCAT 

CAGTTGGAGAGGAGTTTTATAGGAATC^TTCTC^GCCACCGCCTCTTGAGACCTTAGGGCT 

TGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTTTGTAAATCCTGCAGGGGAGGTTAGTGCACCT 

TCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATACGGTTCTAAACCGTCCTC 

CCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATCTCCGGTTCTGAAATGTAC 

TGCGGGAGCCTACATGTTTCCTGATACAATGCTACAAGCAGCAGGATGCTTTGTGGGGGTCGTCCTGTCC 

TCTGAGTTACCAGAGGATGATAGAGAGCTCTTTGAGGATCT^ 

AGGCCAACTGGAACAGAGCAGAAGAAGAAAATGAATTCCAAATCCCTGGAAGAACTAGACCCTCCTCTGA 
CCAACTAAAAGAAGCCTCTGGCACTGATGTGAAACAGTTGGACCAAGGCAATAAGGATGTACGTCATAAA 
GGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGTTAACCTGAGTCACATTGTACCATGTG 

agccagttccagaagaaaagccaaaagaattacctgamggag 
aggtgcttcctgggtgagttggggtttagtcaaaggtgctgagat^^ 

gcttctaaactccgagagcggattcaaccagaagaaaaacccgtggaagttagtcc7vgctgtcaccaagg 
gactttatatagcgaagcaagctagaggaggagc^gcaa^^ 
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CACTGTAGCAAATTGCGTTGGAAAAGAACT 

AGAATCTCTTAAAAAAGACAAAGATGGGAAATCTCCTCT<^^ 

GTTCAAGGATTTTCAACTGTCTGGCAAGGATTGGAATGTG 

CAGAAACTGTACAAACTGTCAGATACAAATACGGATAT^ 

TTCTGCGGTCAATGTTGGCGTAACTGCCTACAATATTT^ 

ACTGCAACACAAACAGGACACACTCTCCTTGAGGACTATCAGATA 

AAGAAGGAGCAGCAAATGTCAACGTGAGAGGGGAGAAGGATGAGCA^ 

GAAGAAAGATAAATGATGAAGTGCTGGGAATCACTTATACCAAAGCCTT 

TAAATAGGCAAATGTGGAATTCCTCACAGATTAACCAGTATTTTTTAAATGTA 

ACTTTCATAAATTTTATGGCATGTCTTCTATTTAAAAGGAAAAGAATAAGTAT^ 

GAAATGTGAAGTTATATTCTCAAGTTTATTT^ 

AAGCTGATAGTACATGTGTTGTTCAAACCTTGTTAAACCTAATATTGAACTATTTTTATATCTGCTGTCT 

TT(^GAAGGCAAATAGGAAACTATATATTTGCTTAAAAATTGGCATTTAGTAACCTTAATT 

GAAGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCACTAAT^ 

TTTTCAGTGTGTAAGCTAAACAAAAACTAAAACTAAGAATTCTCAAAAAAACTTO 

GACTGATGAAAAGTAAAATGGACTACTTTTGTAACTTACCTGTTTGOT 

ATTTAAAATGAATAAAAATAGATTATTACGTCTTTTGTATTGAGACTGTATTGTTATGAGCCTAGGAAAT 

TTGGGAAC^TGATTGTATTGTATTAAAATTCGAAGTGATTATTATCAGCTTAATTGGA 

TTCAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human SPG20 mRNA sequence - var8 (public gi: 3043743) (SEQ ID NO: 374) 

GCGGCCGCGCAGGGAGCTCTCGAGGCAACGCCGGGGCGCCCGAGGTCTGGAAGGCGCAGAAATGGAGCAA 

GAGCCACAAAATGGAGAACCTGCTGAAATTAAGATC^TCAGAGAAGCATATAAGAAGGCCT 

TTAACAAAGGTCTGAATACAGATGAATTAGGTCAGAAGGAAGAAG 

AGGACTVCCTGCTCAGAGGGATCAGCATTTCATCAAAAGAGTCTGAACAC^ 

GOTAGAGAGATGCAACAGAAAATGAAAGAAACTCTA 

AGGGTCTTGCCACTTCTCTGGAGAATGATCTTCAG 

AGACATGTGTGAAAAATTACCAGAGCCTCAGTCTTTTAGTTCAGCTCCTCAGCATGCTGAAG 
AAOkCCTCAACTCCAAGTGCAGGGGCAGTTGCTGC^ 

CAGCAGAAGCTCCTCCTGCTTATACTCCTCAAGCTGCTGAAGGTC^CrACACTGTATCCTATGGAACAGA 

TTCTGGGGAGTTTTCATCAGTTGGAGAGGAGTTTTATAGGAATCATTCTCAGCCACCGCCTCTTGAGACC 

TTAGGGCTGGATGCAGATGAATTGATTTTGATACCAAATGGAGTACAGATTTTTTTTGTAAATC 

GGGAGGTTAGTGCACCTTCGTATCCTGGGTACCTTCGAATTGTGAGGTTTTTGGATAATTCTCTCGATAC 

GGTTCTAAACCGTCCTCCCGGGTTTCTTCAGGTTTGTGACTGGTTATATCCTCTAGTTCCTGATAGATCT 

CCGGTTCTGAAATGTACTGCGGGAGCCTACATGTTTCCTGA^ 

TGGGGGTCGTCCTGTCCTCTGAGTTACCAGAGGATGATAGAGAGCTCTTTGAGGATCTGTTAAGGCAAAT 

GTCTGACCTTCGGCTCGAGGCCAACTGGAACAGAGCAGAAGAAGAAAATGAATTCCAAATCCCTGGAAGA 

ACTAGACCCTCCTCTGACCAACTAAAAGAAGCCTCTGGCACTGATGTGAAACAGTTGGACCAAGGCAATA 

AGGATGTACGTCATAAAGGAAAACGTGGAAAAAGGGCTAAAGATACTTCAAGTGAAGAAGTTAACCTGAG 

TCACATTGTACCATGTGAGCCAGTTCCAGAAGAAAAGCCAAAAGAATTACCTGAATGGAGTGAAAAA 

GCTCAC^CATTTTGTCAGGTGCTTCCTGGGTGAGTTGGGGTTTAGTCAAAGGTGCTGAGATTACTGGTA 

AGGCAATCCA.GAAAGGTGCTTCTAAACTCCGAGAGCGGATTCAACCAGAAGAAAAACCCGTGGAAGTTAG 

TCCAGCTGTCACCAAGGGACTTTATATAGCGAAGCAAGCTACAGGAGGAGCAGCAAAAGTCAGTCAGTTC 

ctggttgatggagtttgcactgtagcaaattgcgttggj^ 

gaagcaaacttgttccagaatctcttaaaaaaga(^aagatgggaaatctcctctggatggtgctatggt 

tgtagcagcaagtagtgttcaaggattttcaactgtctggcaaggattggaatgtgcagct 

gttaacaatgtttcagcagaaactgtacaaactgtcagatacaaatacggatataatgcaggagaag 

cccaccatgcggtggattctgcggtcaatgttggcgt^ 

!agcaatggtgaagaaaactgcaacacaaacaggacacactct 

tctcagagggaaaatcaagaaggagcagcaaatgtgaacgtgagaggggagaaggatgagcagacga 
aagtaaaggaggcaaagaagaaagataaatgatgaagtgctgggaatcacttataccaaagccttatgaa 

ATGGATGAAATTTTGTTAAATAGGCAAATGTGGAATTCCTCACAGATTAACCAGTATTTTTTAAATGTAT 

TCATTCCTACAAATTAACTTTCATAAAOTTTATGGCATGTCTTCTATTTAAAAGGAAAAGAATAAGTATT 

CTTGCATCTGGCCTTAGAAATGTGAAGTTATATTCTC^GTTTATTTTTTTCCAAGTGTAGCTAAAATAT 

TTTTGCAGGTAAAATAAAGCTGATAGTACATGTGTTGTTCAAACCTTGTTAAACCTAATATTGAACTATT 

TTTATATCTGCTGTCTTTCAGAAGGCAAATAGGAAACTATATATTTGCTTAAAAATTGGCATTTAGTAAC 

CTTAATTCTTTTTATAGAAGGAATGACTTAAAGTATTGTCCCCTCTTTTTGCACTAATTGTGGATTTTTT 

TAGATGCTTCTCAAAATTTTCAGTGTGTAAGCTAAACAAAAACTAAAACTAAGAATTCTCAAAAAAAC 

GTTCAAAACAGGGAAAGACTGATGAAAAGTAAAATGGACTACT^ 

TGGAATGGTCTCTTTGATTTAAAATGAATAAAAATAGATTATTACGTCTTTTGTATTGAGAOT 
TATGAGCCTAGGAAATTTGGGAACATGATTGTATTGTATTAAAATTCGAAGTGATTATTATCAGCTTAAT 
TGGATTAAAAAAGTACTTCAAGAAATTATTTTATCATATCTGCITCTGTTTTTCCAAAAGGTTAAAACTT 
GTAAAAAAAATATATATAAACAATTGAGTTTACTAATGGTAAACATTTTTATTCTGGGATTCGGTCATTG 
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GAATTTATATTAAAAGAG^GTTATTAAAAAGGAAAG 

ACCTTAGACGTAATCCATGGTGGATAGGCAT^ 

CCTATTTACTTACATAGGCTAAAAAACTATGTAA 

ATTTAAGAGATTGGTA!TTAGTTTT(^TAGCTGTAGTCCATTCTAATAATT^ 
TTAATTAGACATTATTTGAAGCTGTCTGAAGAATGCACTTTATGAATTAAAAAACTG^ 
CGTTATCACATGAGCTTATATTTTGGGAAGACATAGAAC^^ 
TGTACTAGTTGTTAAAATGGAAATAAAAGTGAAGTGGTAAAT 

Human SPG20 protein sequence - varl (public gi: 28436885) (SEQ ID NO: 386) 

MEQEPQNGEPAEIKZIREAYKKAFLFVNKGLN^ 
WESARQMQQKMKETLQNVRTRIiEIIjEKGIjAT^ 

VNGNTSTPSAGAVAAPASLSLPSQSCPAEAPPAYTPQAAEGHYTVSYGTDSGEFSSVGEEFYR^ 
LETIiGIjDADEIjIIjI PNGVQI FFVNPAGEVSAPSYPGYLRI VRFLDNSLDTVLNRPPGFMVCDWLYPLVP 
DRSPVLKCTAGAYMFPDTMLQAA(k:FVGVVLSSEL^ 
PGRTRPSSDQLKEASGTDVXQLDQGNKDVRHKGKR 

EKVAHNI LSGAS WVS WGLVKGAE I TGKAI QKGAS KLRERI QPEE KPVEVS PAVTKGL YI AKQATGGAAKV 
SQFLVDGVCTVANCVGKEkAPHVKKHGSKLTO^ 

KCIVNNVSAETVQTVRYKYGYNAGEATHHAVDSAVNVGVTAYNINNI GI KAMVKKTATQTGHTLiLEDYQ I 
VDNSQRENQEGAA1TVNVRGEKDEQTKEVKEAKKKDK 

Human SPG20 protein sequence - var2 (public gi: 22074832) (SEQ ID NO: 387) 

MEQEPQNGE PAE I KI IREAYKKAFLFVNKGLNTDELfGQKEEAKNYYKQGIGHIjIjRGI S I SSKESEHTGPG 
WESARQMQQKMKETLQNVRTRLEIL^ 

WGNTSTPSAGAVAAPASLSLPSQSCPAE^PAYTPQAAEGHYWSYGTDSGEFSSVGEEFYRNHSQPPP 
LETLGLDADELI LI PNGVQI FFVNPAGEVS APS YPGYLRIVRFLDNSLDTVLNRPPGFLQVCDWLYPIiVP 
DRS PVLKCTAGAYMFPDTMIiQAAGCFVGVVLS SELPEDDRELFEDLLRQMSDLRIjQANWNRAEEENE FQI 
PGRTRPSSDQIjKS^GTDVKQLDQGNKDVRHKGKRGKRAKDTSSEEVNLSHIV^ 

ekvahnilsgaswvswglvkgaeitgkaiqkgasklreriqpeekpvevspawkglyiakqatggaa 

sqflvdgvctvancvgkeijvphvkkhgsklvpeslkkdkdgkspldga 

kcivnnvsaetvqtvrykygynageathhavd^ 

VDNSQRENQEGAANVNVRGEKDEQTKEVKEAKKKDK 

Human SPG20 protein sequence - var3 (public gi: 3043744) (SEQ ID NO: 388) 

RPRRELSRQRRGARGLEGAEMEQEPQNGEPAEIKIIREAYKKAFIjFW^ 
GHIiIiRGISISSKESEHTGPGWESARQMQQKMKETLQ^ 

DMCEKLPEPQSFSSAPQHAEVNGNTSTPSAGAVAAPASLSLPSQSCPAEAPPAYTPQAAEGHYTVSYGTD 
SGEFSSVGEEFYRNHSQPPPLETLGLDADEIjII*I PNGVQI FFVNPAGEVSAPSYPGYLRIVRFIjDNSIiDT 
VLNRPPGFIiQVCDWIiYPLVPDRS PVLKCTAGAYl^PDTMbQAAGCFVGVVIiS SEIiPEDDRELFEDDDRQM 
SDLRLQANWNRAE EENEFQ I PGRTRPS SI^LKEASGTDVKQLDQGNKDVRHKGKRGKRAKDTSS EEVNLS 
HIVPCEPVPEEKPKELPEWSEKVAHNILSGASWVSWGLVKGAEITGKAIQKGASKLRERIQPEEKPVEVS 
PAVTKGIiYIAKQATGGAAKVSQFLVDGVCTVANCT^ 

VAASS VQGFSTWQGLECAAKCI VNNVSAETVQTVRYKY K 
AMVXKTATQTGHTLIjEDYQIVDNSQRENQEGAANVNVRGEKDEQTKEVKEAKKKn 

Unigene Name; WASF1 Unigene ID: Hs. 75850 

Human WASF1 mRNA sequence - varl (public gi: 4507912) (SEQ ID NO: 375) 

CTTCTCTTGCACTTGCGGATGATGAACTGGAATAACGATGAAAGAAAGCACATCCGATCTCAACATTCAC 

GTCCTGCCCTATAACCGATTAATTAATTGATCCCCAGCTAGACTAGTGTTGGAGAAATCAGCATGTTAAA 

ACAACTGTTGATGATAGCTGTTGGAGTAAAGTTGC^GTGGAAGCTATGGCTGCAAAATCGTTAAAAT 

CAAGGTGAACTGGCAGAAAGGTTAATCTCAAGATGCCGCTAGTC 

GTGCCAGACAGCACTGCCTAGAGGGATTAAGAATGAACT 

ATAATTAGACAACTAAGTAGCCTAAGTAAATATGCTGAAGATATATTTGGAGAATTATTCAATGAAGCAC 

ATAGTTTTTCCTTCAGAGTCAACTCATTGCAAGAACGTGTGGACCGTTTATCTGTTAGTGTTACACAGCT 

TGATCCAAAGGAAGAAGAATTGTCTTTGCAAGAT^^ 

G^GACCAGCAGCTTTTCGATCGCAAGACTTTGCCT^ 

AGCCTCCACCTCTCAATATACTC^CrC^ 

TCCTTCGTATTTCTTTGATCTATGGAAAGAAAAAATGTTGCAAGATACAGAGGATAAGAGGAAGGAAAAG 
AGGAAGCAGAAGCAGAAAAATCTAGATCGTCCTCATGAACCAGAAAAAGTGCCAAGAGC^CCTCATGAC^ 
GGCGGCGAGAATGGCAGAAGCTGGCCCAAGGTCCAGAGCTGGCTGAAGATGATGCTAATCTCTTACATAA 
GCATATTGAAGTTGCTAATGGCCC^GCCTCTCA^ 
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GATGGATCTTACTCACTTTCTGCCTTC 

GGGTATTAGTCAGACCACATGAACCACCTCCACCTCCACCAATGCATGGAGCAGGAGATO 

ACCCACCTGTATCAGTTCTGCTACAGGTTTGATAGAAAATCGCCCT 

CCTGTGTTTGTGAGCCCCACTCCCC^CCTCCTC^^ 

TAAGAGCT^CAATGACTTCAACTCCTCCCCCTCCA^^ 

GCAAGCTCCAGCAGTACCACCACCTCCAGCTCCrCCT 

CCTCCAATTGCACCTCCTCTAGTACAGCCCTCTCCACCAGTAGCTAGAGC 

TACCAGTTCATCCACTCCCACAAGGTGAAGTTCAGGGGCTG 

TCCACCTGGCATTCGACCATCATCACCTGTC^ 

CCAACTCCATCTACTGCCCCAGGTCCCCATGTTCGAT^ 

CTGCTTCTGAGCGAAAGCGCCATCCATCAACCCTACCTGTAATCAGTGATGCCA 

AGCAATACGAAAAGGTATTCAGCTACGCAAAGTAGAAGAGCAGCGTGAACAGGAAGCT 

ATTGAAAACGATGTTGCCACCATCCTGTCTCGCCGTATTGCTGTTGAATATAGTGA 

CAGAATTTGATGAAGTAGATTGGTTGGAGTAAGAAAAATGCA 

ATGTCOTTTGTGGTGCTTGTTCCTTGAAAATGTTTGGTCATTCTAGTGTT 

TAAATGACCCTTTTCCTCCATAACTTTTGATTT CTAAGGAAAATATTAG CATACATTTCAAACTAAATGT 
TTTACAGTGGCTTATCTTTTTTTTCCCC^ 

CATGGACAGCTGTTGTTAGAGTAGCAGATTCAGTTTTTTGATATATCTTAATTO 
TAATTTAAAGAAAGCAACTGAAATTGAAATCI^GAGGGCAGCTGTATCTACTAATGAGCCTT^ 
TCCTGATGTTTTAAAAGAAGAAA(^CTGCCTTGATTATACGAATACACTCAGAAAGTACATTTAGCTTGT 
AGTGTTGAATTCTCTTAAAGGAATGCTTGAATTTTTTCATTATTGTTTTATTC 

TATTTGAATGTTTAGCAGTATCCCCTTCCCACTTATATATTGTGTGATATGATTTTGCTTGCCTATAGGA 
GTTA7VAAACTTTTCCATGTGAAATAOTCTGACTTAAACATACATC 
CAGTCTGATTTAATAAATGGTTCATTTTAAAAGTT - 

Human WASF1 mRNA sequence - var2 (public gi: 4927209) (SEQ ID NO: 376) 

ATGCCGCTAGTGAAAAGAAACATCGATCCTAGGCACTTGTGCCACACAGCACTGCCTAGAGGCATTAAGA 

ATGAACTGGAATGTGTAACCAATATTTCCTTGGC^AATATAATTAGACAACTAAGTAG 

TGCTGAAGATATATTTGGAGAAT^ATTCAATGAAGCACATAGTTTTTCCTTCAGAGTCAAOT 

GAACGTGTGGACCGTTTATCTGTTAGTGTTAGA(^GCTTGATCCAAAGGAAGAAGAAT^ 

ATATAACAATGAGGAAAGCTTTCCGAAGTTCTACAATT 

GCCTATTCCATTAGAGGAGACGTACGATGTTTGTGAACAGCCT 

AGAGATGATGGTAAAGAAGGTCTGAAGTTTTATACCAATCCTTCGTATTTCTTTGATCTATGGAAAGAAA 

AAATGTTGCAAGATACAGAGGATAAGAGGAAGGAAAAGAGGAAGCAGAAGCAGAAAAATCTAGATCGTCC 

TCATGAACCAGAAAAAGTGCCAAGAGCACCTCATGACAGGCGGCGAGAATGGCAGAAGCTG^ 

CCAGAGCTGGCTGAAGATGATGCTAATCTCTTACATAAGCATATTGAAGTTGCTAATGGCCCAGCCTCTC 

ATTTTGAAACAAGACCTC^GACATATOTGGATCATATGGATGGATCrrACTCACTTTCTGCCTTGCCATT 

TAGTCAGATGAGTGAGCTTCTGACTAGAGCTGAGGAAAGGGTATTAGTCAGACCACATGAACCACCTCCA 

CCTCGACCAATGCATGGAGCAGGAGATGCAAAACCGATACCC^CCTGTATCAGTTCTGCTACAGGTTTGA 

TAGAAAATCGCCCTCAGTCACCAGCTACAGGCAGAACACCT^ 

TCCACCACCTCTTCCATCTGCCTTGTC^CTTCCTCATTAAGAGCTTCAATGACTTCAACTCCTCCCCCT 
CCAGTACCTCCCCCACCTCCACCTCCAGCC^CTGCTTT^^ 

CTCTTC^GATTGCCCCTGGAGTTCTTCACCCAGCTCCTCCTCCAATTGCACCTCCTCTAGTACAGCCCTC 

TCCACCAGTAGCTAGAGCTGCCCCAGTATGTGAGACTGTACCAGTTCATCCACTCCCACAAGGTGAAGTT 

CAGGGGCTGCCTCCACCCCCACC^CCGCCTCCT^ 

CAGTTACAGCTCTTGCTCATCCTCCCTCTGGG^ 

TCCATTAATGCCTCCATCTCCTCCATCACAAGTTAT^ 

CTACCTGTAATCAGTGATGCGAGGAGTGTGCTACTGGAAGCAATACGAAAAGGTATTCAGCTA^ 

TAGAAGAGCAGCGTGAACAGGAAGCTAAGCATGAACGCATTGAAAACGATGTTGCCACCATCCTGTCTCG 

CCGTATTGCTGTTGAATATAGTGATTCGGAAGATGATTCAGAATTTGATGAAGTAGATTGGTTGGAGTAA 

GAAAAATGCATTGATAAATATTACAAAACTGAATGCAAATGTCCTTTGTGGTGCTTGTTCCTTGAAAATG 
TTTGGTCA 

Human WASF1 protein sequence - varl (public gi: 4507913) (SEQ ID NO: 389) 

MPIiVKRNIDPRHLCHTALPRGIKNELEWTNISIiANIIRQLSSLSKYAEDI 
ERVDRLSVSVTQIJDPKEEELSIjQDITMRKAF^ 

RDDGKEGLKFYTNPSYFFDLWKEKMIxQDTEDKRKEKRKQKQK3SniiDRP 

PEIJVEDDANLLHKHIEVANGPASHFETRPQTYVDHMDGSYSLSALPFSQMSELLTRAEERVLV^ 
PPPMHGAGDAKPIPTCISSATGLIENRPQSPATGRTPVFVSPTPPPPPPPLPSALSTSSLRASMTSTPPP 
PVPPPPPPPATALQAPAVPPPPAPLQIAPGVLHPAPPPrAPPLVQPSPPVARAAPVCETVPVHPLPQGEV 
QGLPPPPPPPPLPPPGIRPSSPVTVTALAHPPSGLHPTPSTAPGPHVPLMPPSPPSQVIPASEPKRHPST 
IjPVI SDARSVLLEAIRKGIQLRKVEEQREQEAKHERI ENDVATILSRRI AVEYSDSEDDSEFDEVDWLE 
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Unigene Name; HIP-55 Unigene 3CD; Hs . 183373 

Human EQP-55 mRNA sequence - varl (public gi: 6470260) (SEQ ID NO: 377) 

ATGGCGGCGAACCTGAGCCGGAACGGGCCAGCGCTGCAAGAG 
CCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAATG^ 
GGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGTGATC 
GACCCCAACTCTGGACTGCCCAAATTTGTCCTCATCAACT^ 

AGGGAGCCTGTGCCAGCCACGTCAGCACCATGGC<^GCTTCCTGAAGGGGGCC(^ 

ACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGCC^ 

AGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGC^ 

AGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTTCTGGGCCAAAGCAGAGAAGGAG^ 

GGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAGGCACAGCGGCAGCT 

GAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTAT 

AGAGGACGTGGGAGCAG CAGCAAGAAGTGGTTTCAAGGAACCGAAATGAG CAGGAGTCTGCCGTGCACCC 

GAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCCACGA.CCTC 

CTGAGGAGCCCCTTCCTGCAGAAGCAGCTCACCCAACCAGAGACCCACT 

CCATCTCAAGGCCC^GGGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGCACTCCTC 

GGCAGAAGAGGAGGCTGTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGCAGCCCCCACTGGTG 

CAGCAGCAAGGTGCCGGCTCTGAGCACATTGACCACCACA 

TCTGTGCCCGTGCCCTGTACGACTACCAGG<^GCGGACGACACAGAGATCTC(^TGACCCCGAGAACCT 

CATCACGGGCATCGAGGTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCATG 

TTCCCTGCCAACTACGTGGAGCTC^TTGAGTGAGGCTGAGGGCGGCCGCTAGACTAGTCTAGAGAAAAAA 
C 

Human HiP-55 mKNA sequence - var2 (pubHc gi: 8885629) (SEQ ID NO: 378) 

GAAGCTACAGCAGCGGCGCGGAGACTGCGGGGCX5GGCCATGGCGGCGAACCTGAGCCGGAACGGGCCAGC 

<3CTGCAAGAGGCCTACGTGCGGGTGGTCACCGAGAAGTCCCCGACCGACTGG 

GGCAACAGCAATGACATCCGCGTGGCTGGCACAGGGGAGGGT 

ACAGCGGGAAGGTGATGTACGCCTTCTGCAGAGTGAAGGACCCCAACTCTGGACTGCCCAAATTTGTCCT 
CATCAACTGGACAGGCGAGGGCGTGAACGATGTGCGGAAGGGAGCCTGTGCCAGCCACGTCAGCACCATG 
GCCAGCTTCCTGAAGGGGGCCCATGTGACCATCAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCA 
TCATGGAGAAGGTGGCCAAGGCTTCAGGTGCCAACTAC^^ 

CGTGGGACCCCAGGCCCCAGTGGGCTCTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTT 
GGTAAAGAC^GCraCTGGGCCAAAGCAGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCG^ 
CCGAGGAGGCACAGCGGCAGCTGGAGCAGGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGA 
GCAGCGCTATCAGGAGC^GGGTGGCGAGGCCAGCCCCCAGAGGACGTGGGAGCAGCAGCAAGAAGTGGTT 
TCAAGGAACCGAAATGAGCAGGAGTCTGCCGTGCACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGGCCA 
TGTCCACCACCTCCATCTCCAGTCCTCAGCCTGGCAAGCT^ 

CCAACCAGAGACCCACTTTGGCAGAGAGCCAGCTGCTGCCATCTCAAGGCCCAGGGCAGATCTCCCTGCT 

GAGGAGCCGGCGCCCAGCACTCCTCCATGTCTGGTGCAGGCAGAAGAGGAGGCTGTGTATGAGGAACCTC 

CAGAGCAGGAGACCTTCTACGAGCAGCCCCCACTGGTGCAGCAGCAAGGTGCTGGCTCTGAGCACATTGA 

CCACCACATTCAGGGCCAGGGGCTCAGTGGGCAAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCA 

GCCGACGACACAGAGATCTCCTTTGACCCCGAGAACCTCATCACGGGCATCGAGGTGATCGACGAAGGCT 

GGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCATGTTCCCTGCCAACTACGTGGAGCTCATTGAGTG 

AGGCTGAGGGCACATCTTGCCCTTCCCCTCTCAGACATGGCTTCCTTATTGCTGGAAGAGGAGGCCTGGG 

AGTTGACATTCAGCACTCTTCCAGGAATAGGACCCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGCT 

TGGCAGACTCAGCCTGTCACCCCAAATGCAGCAATGGCCTGGTGATTCCCACACATCCTTCCT 

CCGACCCTCCCAGACAGCTTGGCTCTTGCCCCTGACAGGATACTGAGCCAAGCCCTGCCTGTGGCCAAGC 

CCTGAGTGGCCACTGCCAAGCTGCGGGGAAGGGTCCT 

CTGCATTTATTTGCCTTTTTTCTTTTTCTCTTGCTTCTAAGGGGTGGTGGCCACCACTGTTTAGAATGAC 
CCTTGGGAACAGTGAACGTAGAGAATTGTTTTTAGCAGAGTTTGTGACCAAAGTCAGAGTGGATCATGGT 
GGTTTGGCAGCAGGGAATTTGTCTTGTTGGAGCCTGC^ 

GCCTGGGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATTCAAAAACGGCAGACAC 
AACATGTTCCTCCACGCGGCTCAAAAAAAAAAAAAAAAAAA 

Human HIP-55 mRNA sequence - var3 (public gi: 8917572) (SEQ ID NO: 379) 

ATGGCGGCGAACCTGAGCCGGAACGGGCCAGCGCTGCAAGAGGCCTACGTGCGGGTGGTCACCGAGAAGT 
CCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAATGACATC 

GGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGTGATGTACGCCTTCTGCAGAGTGAAG 
GACCCCAACTCTGKSACTGCCCAAATTTGTTC^ 
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AGGGAGCCTGTTCCAGCCACGTCAGCACCATGGCCAGC^ 

ACGGGC03AGGAGGATGTGGAGCCTGAGTGCATCA 

AGCTTTCACAAGGAGAGTGGCCGCTTCGAGGACGTGGGAC^ 

AGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTT 

GGAGAACCGTCXXX^TGGAGGAAAAGCGGCGGGCCGAGGAGGCAGAGTO 

GAGCGTGAGCTGCGTGAGGCIX3GACGCCGGGAGCAGCGCTA 

AGAGTACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAGGAACCGAAATGAGCAGGAGTCT 

GAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCCACCACCTCCA 

CTGAGGAGCCCCTTCCTGCAGAAGCAGCTCACCCAACCAGAGACCCACTCT 

CCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGAGCCGGC^ 

GGCAGAAGAGGAGGCTGTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTA 

CAGCAGGAAGGTGCTGGCTCTGAGCACATTGACGAC<^ 

TCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGACGACACAGAGA 

CATCACGGGCATCGAGGTGATCKSACGAAGGCTGGTGGCGTGGCT^ 

TTCCCTGCCAACTACGTGGAGCTCATTGAGTGA 

Human HIP-55 mRNA sequence - var4 (public gi: 10121214) (SEQ ID NO: 380) 

GGGGCGGGCCATGGCGGCGAACCTGAGCCGGAACGGGCCAGCG 
ACCGAGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCA^ 

GCACAGGGGAGGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGTGATGTACGCCTTCTG 
CAGAGTGAAGGACCCCAACTCTGGACTGCCCAAATTTGTTCTCATCAACTGGAGAGGCGA 
GATGTGCGGAAGGGAGCCTGTTCC^GCCACGTCAGCACCATGGCCAGCTTC^ 
CCATCAACGGAGGGGCCGAGGAGGATGTGGAGCCTGAGTGCAT 

TGCCAACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGGGCT 
GTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAT^GAGAGCTTCTGGGCCAAAGCAG 
AGAAGGAGGAGGAGAACCGTCGG CTGGAGGAAAAGCGGCGGG CCGAGGAGGCACAG CGGCAGCTGGAGCA 
GGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTATCAGGAGCAGGGTGGCGAG 
GCCAGCCCCCAGAGTACGTGGGAGCAGCAGCAAGAAGTGGTTTOUV 

CCGTGCACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCCACCACCTCCATCTCC^GTC 
GCCTGGCAAGCTGAGGAGCCCCTTCCTGCAGAAGCAGCTCAC^ 

CCAGCTGCTGCCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGCACTCCTCCAT 
GTCTGGTGCAGGCAGAAGAGGAGGCTGTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGCAGCC 
CCCACTGGTGCAGC^GCAAGGTGCTGGCTCTGAGCACATTGACC^ 

GGGCAAGGGCTCTGTGCCCGTGCCCT^TACGACTACCAGGCAGCCGACGACACAGAGATCTCCT^ 

CCGAGAACCTCATCACGGGCATCGAGGTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCA 

TTTTGGCATGTTCCCTGCCAACTACGTGGAGCTCATTGAGTC 

TCTCAGAC^TGGCTTCCTTATTGCTGGAAGAGGAGGCCTGGGAGTTGACATTCAGCACTCTTCCAGGAAT 
AGGACCCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGCTTGGG 

CAGCAATGGCCTGGTGATTCCCACACATCCTTCCTGCATCCCCCGACCCTCCCAGAGAGCTTGGCTCTTG 

CCCCTGAC^GGATACTGAGCCAAGCCCTGCCTGTGGCCAAGCCCTGAGTGGCCACTGCCAAGCTGCGGGG 

AAGGGTCCTG7VGCAGGGGCATCTGGGAGGCTCTGGCTGCCTTCTGCATTTATTTGCCTTTTTTCTTTTTC 

TCTTGCTTCTAAGGGGTGGTGGCCACCACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAGAATTG 

TTTTTAGCAGAGTTTGTGACCAAAGTCAGAGTGGATCATGGTGGT^ 

GGAGCCTGCTCTGTGCTCCCCACTCCATTTCTCTGTC 

ATGGCCAAGCTCCCACCCTGGGTATTCAAAAACGGCAGACACAAC^^ 

TGCCTGCAGGCCCGAGTGTGTGCCTC^AOTGATTCTGACTTCAGGAAAAGTAACACAGAGTGGCCT 

CTGTTGTCTTCCCCTATTTTCTGTCCCAGCTCATCCGXGTCTCTGAAGAATAAATATGGTTTTGGAAAAA 

AAAAAAAAAA 

Human HIP-55 mRNA sequence - var5 (public gi: 10441969) (SEQ ID NO: 381) 

GACCATCAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGCCAAGGCTTCA 

GGTGCCAACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGGGCT 

CTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTTCTGGGCCAAAGC 

AGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAGGCACAGCGGCAGCTGGAG 

CAGGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTATCAGGAGCAGGGTGGCG 

AGGCCAGCCCCCAAAGGACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAGGAACCGAAATGAGCAGGAGTC 

TGCCGTGCACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCCACCACCTCCATCTCCAGTCCT 

CAGCCTGGCAAGCTGAGGAGCCCCTTCCTGCAGAAGCAGCTCACC^ 

AGCGAGCTGCTGCCATCTCAAGGCCCAGGGCAGATCTCC 

ATGTCTGGTGCAGGCAGAAGAGGAGGCTGTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGCAG 
CCCCCACTGGTGCAGCAGCAAGGTGCTGGCTCTGA 

GTGGGCAAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGACGACACAGAGATCTCCTTTGA 
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ccccgagaacctcatcacgggcatcgaggtgatcgacgaaggctggtgg cgtggctatgggccggatggc 

cattttggcatgttccctgccaactacgtggagct 

cctctcagacatggcitccttattgctggaagaggaggcctgggag 

ataggacccccagtgaggatgaggcctcagggctccctccggcttggcagactca 

tgcagcaatggcctggtgattcccacacatccitcctgcatcccccgaccctccc^ 

tgcccctgacaggatactgagccaagccctgcctgtggcc^gcc 

ggaagggtcctgagcaggggcatctgggaggctctggctggcttotgca 

tctcttgcttctaaggggtggtggcgaccactc 

tgtttttagcagagtttgtgacgaaagtcagagtggatcatggtggtttggg^ 
ttggagcctgctctgtgctccccaotccatttotctgtccct 

AGATGGCCAAGCTCCCACCCTGGGTATTCA^ 
GATGCCTGCAGGCCCCAGTGTGTGCCTCAACTGATTCTGACTT 
GCCTGTTGTCITCCCCrATTTTCTGTOCCAGCTCATCCGTGTCT 
ACGAAAAAAAAAAAAAAAAAAAAA 

Human HIP-55 mRNA sequence - var6 (public gi: 14041995) (SEQ ID NO: 382) 

AGCGGCGCGGAGACTGCGGGGCGGGCCATGGCGGCGAACCT 

CCTACGTGCGGGTGGTCACCGAGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAA 
TGACATCCGCGTGGCTGGCACAGGGGAGGGTGGCCTGGAGGAGATGGTGGAG 
GTGATGTACGCCTTCTGCAGAGTGAAGGACCCCAACTCTGGACT 
GAGGCGAGGGCGTGAACGATGTGCGGAAGGGAGCCTGTGCGAGCCACGTC^ 

GAAGGGGGCCCATGTGACCATCAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGG 
GTGGCCAAGGCTTCAGGTGCCAACTACAGCTTCCACAAGGAGAGTGGCCG 

AGGCCCCAGTGGGCTCTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAG 

CTTCTGGGCCAAAGCAGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAGGCA 

CAGCGGCAGCTGGAGCAGGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTATC 

AGGAGCAGGGTGGCGAGGCCAGCCCCCAGAGCAGGACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAGGAA 

CCXxAAATGAGCAGGGGTCAAC^TGTGCTTCCCTCCAGGAGTCTGCCG 

CAGAAGGAGAGGGCCATGTCCACCACCTCCATC^ 

TGCAGAAGCAGCTCACCCAACCAGAGACCCACTTTGGCAGAGAGCCAGCTGCTGCCATCTCAAGGCCCAG 

GGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGC^CTCCTCCATGTCTGGTGGAGGCAGAAGAGGAGGCT 

GTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGC^ 

GCTCTGAGCACATTGACCACCACATCCAGGGCCAGGGGCT 

GTACGACTACCAGGCAGCCGACGAGACAGAGATCTCCT 

GTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCA.TGTTCCCTGCCAACTACG 
TGGAGCTCATOGAGTGAGGCTGAGGGCACATCTTGCC^ 

GAAGAGGAGGCCTGGGAGXTGACATTCAGCACTCTTCCAGGAATAGGACCCCCAGTGAGGATGAGGCCTC 

AGGGCTCCCTCCGGCTTGGC^GACTCAGCCTGTCACCCCAAATGCAGCAATGGCCTGGTGATTCCCACAC 

ATCCTTCCTGCATCCCCCGACCCTCCCAGAC^GCTTGGCTCTTGCCCCTGACAGGATACTGAGCCAAGCC 

CTGCCTGTGGCCAAGCCCTGAGTGGCCACTGCCAAGCTGCG<^ 

AGGCTCTGGCTGCCTTCTGCATTTATTTC 

CACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAGAATTGTTTTTAGCAGAGTTTGTGACCAAAGT 
CAGAGTGGATCATGGTGGTTTGGCAGCAGGGAATTTGTCTTGTTGGAGCCT 

ATTTCTCTGTCCCTCTGCCTGGGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATT 
CAAAAACGGCAGACACAACATGTTCCTCCACGCGGCTCACTCGATGCCTGCAGGCCCCAGTGTGTGCCTC 
AACCGATTCTGACTTCAGGAAAAGTAACACAGAGTGGC 

Human HEP-55 mRNA sequence - var7 (public gi: 15079722) (SEQ ID NO: 383) 

GGCACGAGGGCGGAGACTGCGGGGCGGGCGATGGCGGCGAAC^ 

AGGCCTACGTGCGGGTGGTCACCGAGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAG 

CAATGAC^TCCGCGTGGCTGGC^CAGGGGAGGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGG 

AAGGTGATGTACGCCTTCTGCAGAGTGAAGGACCCCAACTCTGGACTGCCCAAATTTGTCCTCATCAACT 

GGACAGGCGAGGGCGTGAACGATGTGCGGAAGGGAGCCTGTGCCAGCCACGTCAGCACCATGGCCAGCTT 

CCTGAAGGGGGCCCATGTGACCATCAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAG 

AAGGTGGCCAAGGCTTCAGGTGCCAACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGAC 

CCCAGGCCCCAGTGGGCTCTGTGTACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGA 

CAGCTTCTGGGCCAAAGCAGAGAAGGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAG 

GC^C^GCGK3a^GCTGGAGCAGGAGCGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCT 

ATCAGGAGCAGGGTGGCGAGGCCAGCCCCCAGAGCAGGACGTGGGAGCAGCAGC^ 

GAACCGAAATGAGCAGGAGTCTGCCGTGCACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGG 

ACCACCTCCATCTCCAGTCCTCAGCCTGGCAAGCTGAGGAGCCC 

C^GAGACCCACTTTGGC^GAGAGCCAGCTGCTGCCATCTC^GGCCCAGGGCAGATCTCCCTGCTGAGGA 
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GCCGGCGCCXAGCACTCCTCCATGTCTGGTGCAGGCAGAAG^ 

CAGGAGACCTTCTACGAGCAGCCCCCACTGGTGCAGC^ 

ACATTCAGGGCCAGGGGCTCAGTGGGGAAGGG 

CGACACAGAGA^TCTCCTI^GACCCCGAGAACCTCATCACGGG 

CGTGGCTATGGGCCGGATGGCCATTTTGGCATGTTCCCT^ 

GAGGGCACATCTTGCCCTTCCCCTCTGAGACATGGCTTCCTT 

ACATT(^GCACTCTTCCAGGAATAGGACCCCCAGTGAGGATGAGGCCTCAGGG^ 

GACTCAGCCTGTCACCCCAAATGCAGCAATGGCCTGGTGATTCCC^ 

CCTCCCAGACAGCTTGGCTCTTGCCCCTGACAGGATACTGAGCCAAGC^ 

GTGGCC^CTGCGAAGCTGCXaGGGAAGGGTCCTGAGGAGGGGGATCT 

TTTATTTGCCTTTTTTCTTTTTCTCTTGCTTCT 

GGAACAGTGAACGTAGAGAATTGTTTTTAGCAGAGTTTC 

GGCAGCAGGGAATTTGTCTTGTTGGAGCCTGCTCTGTGCTCCCCACTCCATTTCTCTGTCCCTCTGCCTG 

GGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATTCA^ 

GTTCCTCCACGCGGCTCACTCGATGCCTGCAGGCCCCAGTGTC 

AAGTAACAGAGAGTGGCCTTGGCCTGTTGTCTTCCCCTATTTTCTGTCCC^ 

GAACAAATATGCTTTTGGACCACGAAAAAAAAAAAAAAAA^^ 

Human HEP-55 mRNA sequence - var8 (public gi: 21619482) (SEQIDNO: 384) 

aSGGCCATGGOSGCGAACCTGAGCCGGAACGGGC 

AGAAGTCCCCGACCGACTGGGCTCTCTTTACCTATGAAGGCAACAGCAATGACATCCGCGTG^ 
AGGGGAGGGTGGCCTGGAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGT^ 

GTGAAGGACCCCAACTCTGGACTGCCCi^AATTTGTCCTCATCAACTGGACAGGCGAGGGCGTGAACGATG 

TGCGGAAGGGAGCCTGTGCC^GCCACGTCAGCACCATGGCCAGCTTCCTGAAGGGGGCCCATGTGACCAT 

CAACGCACGGGCCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGCCAAGGCTTCAGGTGCC 

AACTACAGCTTTCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCC^GGCCCCAGTGGGCTCTGTGT 

ACCAGAAGACCAATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTTCTGGGCCAAAGCAGAGAA 

GGAGGAGGAGAACCGTCGGCTGGAGGAAAAGCGGCGGGCCGAGGAGGCACAGCGGCAGCTGGAGCAGGAG 

CGCCGGGAGCGTGAGCTGCGTGAGGCTGCACGCCGGGAGCAGCGCTATCAGGAGCAGGGTGGCGAGGCCA 

GCCCCCAGAGGACGTGGGAGCAGCAGCAAGAAGTGGTTTCAAGGAACCGAAATGAGCAGGAGTCTGCCGT 

GGACCCGAGGGAGATTTTCAAGCAGAAGGAGAGGGCCATGTCCACGACCTCC^ 

GGGAAGCTGAGGAGCCCCTTCCTGCAGAAGCAGCTCACC 

CTGCn^CCATCTCAAGGCCCAGGGCAGATCTCCCTGCTGAGGAGCCGGCGCCCAGCACTCCTCCATGTCT 

GGTGC^GGCAGAAGAGGAGGCTGTGTATGAGGAACCTCC^^ 

CTGGTGCAGCAGCAAGGTGCTGGCTCTGAGCACATTGAC 

AAGGGCTCTGTGCCCGTGCCCTGTACGACTACCAGGCAGCCGACGACA(^GAGATCTCCTTTGACCCCGA 
GAACCTCATCACGGGCATCGAGGTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTT 
GGCATGTTCCCTGCCAACTACGTGGAGCTCATTGAGTGAGGCTGA^^ 

AGACATGGCTTCCTTATTGCTGGAAGAGGAGGCCTGGGAGTTGACATTCAGCACTCTTCCAGGAATAGGA 
CCCCCAGTGAGGATGAGGCCTCAGGGCTCCCTCCGGGTTGGCAGACTCAGCCTGTCACCCCAAATGCAGC 
AATGGCCTGGTGATTCCCACACATCCTTCCTGCATCCCCCGACCCTCCCAGACAGCTTGGCTCTTGCCCC 
TGACAGGATACTGAGCCAAGCCCTGCCTGTGGCCAAGCCCTGAGTGGCCACTGCCAAGCTGCGGGGAAGG 
GTCCTGAGCAGGGGCATTTGGGAGGCTCTGGCTGCCTTCTGCATTTATTTGCCTTTTTTCTTTTTCTCTT 
GCTTCTAAGGGGTGGTGGCCACCACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAGAATTGTTTT 
TAGCAGAGTTTGTGACCAAAGTCAGAGTGGATCATGGTGGTTTGGCAGC^GGGAATTTGTCTTGTTGGAG 
CCTGCTCTGTGCTCCCCACTCGATTTTTCTGTCCCTCTGCCTGGGCTGTGGGAAGTGGGGATGCAGATGG 
CCAAGCTCCCCCCCTGGGTATTCAAAAACGGCAGACACAACATGTTCCTCCACGCGGCTCACTCGATGCC 
TGCAGGCCCCAGTGTGTGCCTCAACTGATTCTGACTTCAGGAAAAGTAACACAGAGTGGCCTTGGCCTGT 
TGTCTTCCCCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Human HIP-55 raRNA sequence - var9 (public gi: 23959038) (SEQ ID NO: 385) 

GGCACGAGGATTTGACAGATGAATGTATAGCAGTCATTGGGAAACTCCACAGCTC^TGTTTTCCTCATAG 

TAGATGTGTGCTCCCATCTCCATGGCTTTGTCCCTCACAACCCCCACCCCATGGTAAGTCAGGCCAGTGT 

CCTCCCAGCTGCAGAGCTGAGAAGGCTGCAC^GTTGCCTACTGAGAACCTGCCTAGTGGGTCAGAGCAAA 

GTGAGAACGGGCCTGTGCCCACCCACAGTGTTACTGTCAGCCCAAGCTCTTTGGGATGTAGTGGAAAGTC 

ATGGTGGATACGGTGAGGAGAGATGGAACCGAAGGTGCTGGCTACAGAGCTCACTTGTGTTTCGTTTCAG 

GGCTCTCTTTACCTATGAAGGCAAC^GCAATGACATCCGCGTGGCTGGCACAGGGGGTGAGTATGACTCC 

AAATGGACTCAGGGACACCAGGAGGTAGGAGGGTGACGACGAGGGGTCAGCGCACTCAGCTGTCTTGGTC 

CACTGAGCC^CATGGGGCTTCCAGTGCTCACTGGC<^CTTCTGGCAGGCCTTAGGTTTGAGATA 

AGTGAAAACATTCCTCCTTGGTTCTCCTTCCCTCTGGGTGCAGGGGAGTGCTTTCTCTTTGTCTACTTGG 

GGAGAGCTGAGAGGGAAACAGGCCTCTCCAGCTTGTGGGCAGCCTGCGTTGGGAGCTGCGGTGGGAAGCT 

CACCAGTCCCAGAACTGGTGCTGGTGGAAAGAAAGTCCACAGACATATCTTCTTCTCCCTTTGTCCTGCC 
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CTGGTCTTGTGCCGAGTGCTTGCA^ 

CGGGGGAGGCTCTGGAGGTCTCTACAGGAAGGACAGGCT 

TAGGCTTCGTCAGAGTGTGACCTGTGGGCCCCTCAGCT 

CCTGACCCCTCCCTCTGTC^^ 

CGGTTTGATGTCTGTTGTTGGTAGAAAGCCCCTTC 

GAAGGATTGCCATTGAACAGTAGACATGTGGTGT^^ 

AGAGTTTC(^TCATCCCCAGTGGCAC^GGACAGGGCTCIX^ 

AGTGGGGCTCATGACTGCCTCCACTCATACGGAGCCCTGTAGATGAAATACCTGA^ 

TATAACCTGGAAAAGTTTGTGAGGGCTAAGCCTCAGTG 

GTGCTCCCCCTGTCCC(^T<^CC^ 

GTTCAATCTGGTCTGGCCGTCTGGGTGGCCCTC 

TAGAGGCCCTGCTATTCCGGGCCCTTTTGCAAOT 

(^TTTGGAAGTCTGGTGGCTGGTTTGCCGTGGAAA 

TTAGAGAAGTGGTTTTTAAACGAGTGTGGGTAAAAAAAT^ 

CTAGGTCCGACCCAGCTCTCATCAATCAGTTTAGTGAGGGTGGTGCCC^ 

CCCTAGAAAGATTCTGATACAGGTAGAGGTGAGAAGCCCTGGTTTAGAAGCAGCTCGGCCTCCCT^ 

GTGGGACCAGGGCCAGCAGGGAATGTCAGGGCCACCCCTGA 

GCC1K3GAGGAGATGGTGGAGGAGCTCAACAGCGGGAAGGT 

CAACTCTGGACTGCCCAAATTTGTCCTCATCAACTGGACAGGCGAGGGCGTGAACGATGTGCGGAA 

GCCTGTGCCAGCCACGTCAGCACCATGGCCAGCTTCCTGAA^ 

CCGAGGAGGATGTGGAGCCTGAGTGCATCATGGAGAAGGTGGC 

TCACAAGGAGAGTGGCCGCTTCCAGGACGTGGGACCCCAGGCCCCAGTGGGCTCTGTGTACCAGAAGACC 

AATGCCGTGTCTGAGATTAAAAGGGTTGGTAAAGACAGCTTCT 

CGGGGCATGCTGGGCACGTGGGAGTGTTCTGCTTGCTGT 

AGCATCCACTCTCCTTGGTGCCCATTACAGATGGTCACACTGAGGCTCGGGTAAGTTAAGCCACAAGGCT 

AATGATCGACTGGCTCTGGTGCCCGTCTTTGGCGATGTGCOTAAAACTCAGTCCTGGGCAGGGGATTAGG 

CTGAAGTGGCAGCATAGGGCTGAGCGGGCAGTGGCTCTCCCTG(^GAAGGAGGAGGAGAACCGTCGGCTG 

GAGGAAAAGCGGCGGGCCGAGGAGGCAC AGCGGCAG CTGGAG CAGGAG CGC CGGGAGCGTGAG CTGCGTG 

AGGKITGCACGCCGGGAGGAGCGCTATCAGGAGCAGGGTGGCGAGGCCAGCCCCCAGAGGACGTGGGAG^ 

GCAGCAAGAAGTGGTTTCAAGGAACCGAAATGAGCAGGAGTCTGCCGTGCACCCGAGGGAGATTTTCAAG 

CAGAAGGAGAGGGCCATGTCCACCACCTCCATCTCCAGTCCTCAGCCTGGCAAGCTGAGGAGC 

TGCAGAAGCAGCTC^CCCAACCAGAGACCCACTTTGGCAGAGAGCCAGCTGCTGCCATOT 

GGCAGATCTCCCTGCTGAGGAGCCGGCGCCC^GCACTCCTCCATGTCTGGTGCAGGCAGAAGAGGAGGCT 

GTGTATGAGGAACCTCCAGAGCAGGAGACCTTCTACGAGCAGCCCCCACTGGTGCAGCAGCAAGGTGCTG 

GCTCTGAGCACATTGACCACCACATTGAGGGCCAGGGGCTC^ 

GTACGACTACCAGGCAGCCGACGACAGAGAGATCTCCTTTGACCCCGAGAACCTCATCACGGGCATCGAG 
GTGATCGACGAAGGCTGGTGGCGTGGCTATGGGCCGGATGGCCATTTTGGCATGTTCCCTGCCAACTACG 
TGGAGCTGATTGAGTGAGGCTGAGGGCACATCTTGCCCTTCCCCTCTC^GACATGGCTTCCTTA 
GAAGAGGAGGCCTGGGAGTTGACATTCAGCACTCTTCCAGGAATAGGACCCCCAGTGAGGATGAGGCOTC 
AGGGCTCCCTCCGGCTTGGCAGACTCAGCCTGTCACCCCAAATGCAGCAATGGCCTGGTGATTCCCACAC 
ATCCTTCCTGCATCCCCCGACCCTCCCAGACAGCTTGGCTCTTGCCCCTGACAGGATACTGAGCCAAGCC 
■CTGCCTGTGGCCAAGCCCTGAGTGGCCACTGCCAAGCTGCGGGGAAGGGTCCTGAGCAGGGGC^TCTGGG 
AGGCTCTGGCTGCCTTCTGCATTTATTTGCCTTTTTTCTTTTTCTCTTGCTTCTAAGGGGTGGTGGCCAC 
CACTGTTTAGAATGACCCTTGGGAACAGTGAACGTAGAGAATTGTTTTTAGCAGAGTTTGTGACCAAAGT 
CAGAGTGGATCATGGTGGTTTGGC^GK^GGGAATTTGT 

ATTTCTCTGTCCCTCTGCCTGGGCTATGGGAAGTGGGGATGCAGATGGCCAAGCTCCCACCCTGGGTATT 

CAAAAACGGCAGACACAACATGTTCCTCCACGCGGCTCACTCGATGCCTGCAGGCCCCAGTGTGTGCCTC 

AACTGATTCTGACTTCAGGAAAAGTAACACAGAGTGGCAAAAAAAJ^ 

AAAAAAAAAAA 

Human HJP-55 protein sequence - varl (public gi: 21619483) (SEQ ID NO: 390) 

MAANLS RNGPALQEAYVRWTE KS PTDWA&FTYEGNSND I RVAGTGEGGIjEEMVEELNSGKVMYAFCRVK 
DPNSGL P KFVL I NWTGEGVNDVRKGACASHVS TMAS FLKGAHVT INARAE EDVEPEC I ME KVAKAS GANY 
SFHKESGRFQDVGPQAPVGSVYQKTNAVS E I KRVGKDS FWAKAEKEEENRRLEEKRRAEEAQRQLEQERR 
ERELREAARREQRYQEQGGEASPQRTWEQQQEVVSRNRNEQESAVHPREIPKQKERAMSTTSISSPQPG^ 
LRSPPDQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPPEQETFYEQPPIjV 
QQQGAGSEHIDHHIQGQGLSGQGLCARALYD YQAADDTE I S FDPENL I TGI EV IDEGWWRGYGPDGHFGM 
FPANYVELIE 

Human HEP-55 protein sequence - var2 (public gi: 15079723) (SEQ ID NO: 391) 

MAANLS RNGPALQEAYVRVVTEKSPTOWALFTYEGNSNDIR 
DPNSGLPKFVLlNWTGEGVNDVRKGACASHVSTMASFLKGi^ 
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S FHKESGRFQDVGPQAPVGSVYQKTNAVSE IKRVGKDS FWAKAE KEEENRRLEEKRRAEEAQRQLEQERR 

ERELREAARREQRYQEQGGEAS PQSRTWEQQQEWSRNRNEQES AVHPRE I FKQKERAMSTTS I S S PQPG 

KLRSPFLiQKQLTQPETHFGREPAAAISRPRADIjPAEEPAPSTPPCIjVQAEEEAVyE 

VQQQGAGSEHIDHHIQGQGLSGQGIjCARAIiYDYQAADDTEISFDPENIjITG 

MFPANYVELIE 

Human HLP-55 protein sequence - var3 (public gi: 14041996) (SEQ ID NO: 392) 

MAANLSRNGPALQEAYVRVWEKS PTDWALFT^ 
DPNSGLPKFVLINWTGEGVNDVRKGACASHVSTMASFLKGAHV^ 

S FHKESGRFQDVGPQAPVGS VYQKTNAVSEI KRVGKDS FWAKAE KEEENRRLEE KRRAEEAQRQLEQERR 
ERELREAARREQRYQEQGGEAS PQSRTWEQQQEVVSRNRNEQGSTCASLQESA^ 
SISSPQPGKLRSPFLQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPC^ 

TFYEQPPLVQQQGAGSEHIDHHIQGQGLSGQGLCARALYDYQAADDTEISFDPENIilTGIEVIDEGVWRG 
YGPDGHFGMFPANYVELTE 

Human HIP-55 protein sequence - var4 (public gi: 10441970) (SEQ ID NO: 393) 

MEKVAKASGANYS FHKESGRFQDVGPQAPVGS VYQKTNAVSE I KRVGKDS FWAKAE KEEENRRLEEKRRA 
EEAQRQLEQERREREIjREAARREQRYQEQGGEAS PQRTWEQQQEWSRNRNEQESAVHPRE I FKQKERAM 
STTSISSPQPGKLRSPFLQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPP 
EQETFYEQPPLVQQQGAGSEHIDHHIQGQGLSGQGLCARAIiYDYQAADDTEISFDPENLITGIEVIDEGW 
WRGYGPDGHFGMFPANYVELIE 

Human HIP-55 protein sequence - var5 (public gi: 10121215) (SEQ ID NO: 394) 

MAANLSRNGPALQEAYVRWTEKS PTDWALFTYEGNSNDI RVAGTGEGGLEEWTOEIiNSGKVMYAFCRVK 
DPNSGLPKFVLXNWTGEGVNDVRKGACSSHVSTMASFLKGAHV^ 

S FHKESGRFQDVGPQAPVGS VYQKTNAVS E I KRVGKDS FWAKAE KEEENRRL EE KRRAEEAQRQLEQERR 
ERELREAARREQRYQEQGGEASPQSTWEQQQEWSRNRNEQESAVHPREIFKQKERAMSTTSISSPQPGK 
LRSPFLQKQLTQPETHFGREPAAAISRPRADLPAEEPAPSTPPCLVQAEEEAVYEEPPEQETFYEQPPLV 
QQQGAGSEHIDHHIQGQGLSGQGLCARALYDYQAADDTEISFDPENLITGIEVIDEGWWRGYGPDGHFGM 
FPANYVELIE 
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